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KUNTTZ TYTE PLASMA KALLIKREIN INHIBITORS 

Field of the Inventipn 

This invention relates to novel polypeptides which comprise at least one Kunitz-type domain having 
plasma kallikrein inhibitory activity. The invention further relates to the DNA encoding these novel 
5 polypeptides, and the recombinant materials and methods for producing these plasma kallikrein inhibitors. The 
invention also relates to pharmaceutical compositions containing the novel plasma kallikrein inhibitors for the 
treatment of diseases and disorders where the inhibition of plasma kallikrein is indicated. 

pespriiptfon, of Related Art 

Kallikrein 

1 0 Kallikrein is a serine protease of the multicomponent coagulation cascade that participates in the contact 

system of the intrinsic pathway of blood coagulation. Kallikrein also cleaves high molecular weight kininogen 
(HMWK) to form bradykinin (a potent vasodilator and endothelial cell activator), can activate prourokinase and 
plasminogen (fibrinolytic), and feeds back for reciprocal activation of surface bound Factor XII to Factor Xlla. 
In addition, it can also stimulate neutrophils causing the release of elastase. Both Factor Xlla and kallikrein can 

15 lead to plasmin generation causing fibrinolysis. Thus, although it plays a central role in the contact activation 
pathway, plasma kallikrein is involved in both fibrin deposition and lysis, modulation of blood pressure, 
complement activation and support of the inflammatory system. For a review of the contact activation pathway 
and kallikrein-kinin system see Bhoola, KJD., et al., (1992), Pharmacological Rev., 44(1): 1-80; and Wachtfogel, 
Y.T., (1993), Thromb. Res., 72:1-21. 

2 0 Prekallikrein, the precursor of kallikrein, is a glycoprotein comprised of a single polypeptide chain with 

a molecular weight of 80,000 Da and is present in normal plasma at a concentration of ca. 50 ug/ml (600 nM). 
In blood, 75 % of prekallikrein circulates bound to high molecular weight kininogen (HMWK). Kallikrein 
consists of 2 disulfide bonded chains of 43,000 and 33,000-36,000 Da, The light chain of kallikrein contains the 
enzymatic domain while the heavy chain appears to be required for surface dependent activation of coagulation. 

2 5 Because of its role in a diverse array of biological functions, regulation of plasma kallikrien as a form 

of therapeutic intervention has been extensively studied. 
Contact Activation Pathways in Disease 

Contact activation is a surface mediated pathway responsible in part for the regulation of inflammation 
and coagulation. The proteins involved in this pathway include Factor XII (Hageman Factor), prekallikrein 

30 (Fletcher Factor), high molecular weight kininogen (HMWK), and CI inhibitor (Schmaier, A. H. et al., in 
Hemostasis and Hirombosis: Basic Principles and Clinical Practice (Colman, R. W., Hirsh, J., Marder, V;, & 
Salzman, E. W., Eds.) 1987, pp 18-38, J. B. Lippincott Co., Philadelphia). The zymogens Factor FXII and 
prekallikrein are converted into active serine proteases as initial events in this pathway. The involvement of this 
plasma protease system has been suggested to play a significant role in a variety of clinical manifestations 

35 including septic shock, adult respiratory distress syndrome (ARDS), disseminated intravascular coagulation 
(DIC) and various other disease states (Coleman R. W. (1989) N. Engl. J. Med 320:1207-1209; Bone, R. C. 
(1992) Arch. Intern. Med. 152:1381-1389). 

The contact system of intrinsic coagulation and the complement system are excessively activated in 
sepsis and septic shock, especially in cases of fatal septic shock. The contact system can participate in the 
4 0 generation of many vasoactive mediators such as bradykinin, FXIIa, FXIIf and C5a, which are thought to play 
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a role in the pathogenesis of fatal shock. Bradykmin, FXIIa, and XUf are potent inducers of hypotension while 
C5a is an inducer of vasodilation and vasopeimeability. The levels of FXU, prekallikrein, and high molecular 
weight kininogen are decreased significantly during non-fatal shock, but are most severely depressed during fetal 
septic shock to approximately 30%, 57% and 27% of normal values respectively. These changes are noted 
5 regardless of whether the septic state is caused by gram positive or gram negative bacteria. 

The contact activation pathway is also involved in both fibrin deposition and lysis, as well as triggering 
neutrophil activation, activation of complement and modulation of blood pressure. 
Septic Shock 

Septic shock is the most common cause of death of humans in intensive care units in the United States 
10 (Parillo, J. E et at, (1990), Ann. Int. Med. 1 13227-242; Schmeichel C. J. & McConnick D., (1992) BioTechnol. 
1 0:264-267). It is usually initiated by a local nidus of infection that invades the blood stream. Incidences of 
sepsis and shock can arise from infections with either gram negative, gram positive bacterial or fungal 
microorganisms. All these organisms seem to induce a common pattern of cardiovascular dysfunction. In recent 
years aggressive fluid infusion therapy has been accepted as a primary means of treatment for septic shock. 
IS Adequate repletion of fluid is associated with an elevated cardiac output and low vascular resistance. Despite 
treatment, septic shock results in a severe decrease in systemic vascular resistance and generalized blood flow 
maldistribution. Aggressive therapy reverses shock and death in about 50% of the cases. Unresponsive 
hypotension resulting from a very low vascular resistance cannot be corrected by fluid infusion. Among those 
subjects that die from septic shock, approximately 75% die from persistent hypotension and the remainder due 
20 to multiple organ system failure. 

The increase in cardiac output and vasodilation in septic shock is attributed to the action of 
inflammatory mediators. In septic shock, components of the kallikrein-kinin system are depleted suggesting 
activation of this system. This is not the case in cardiogenic shock suggesting that the kallikrein-kinin system 
is a key player in septic shock (Martinez-Brotons F. et al., (1987) Thromb. Haemostas. 58:709-713). While the 
actual events leading to septic shock, DIC and hypotension have not been established, the known interactions 
among various components of the many physiological systems suggest that activation of the contact pathway may 
lead to a state of septic shock, multiorgan failure, and death (Bone, R. C, supra) as illustrated in Figure 1. 
ARDS 

ARDS is a complex pulmonary disorder affecting 150,000 people in the U. S. yearly with a 50 % 
mortality rate. Leukocytes, platelets and the proteolytic pathways of coagulation and complement mediate ARDS. 
ARDS involves activation of the contact activation pathway and depletion of C 1 inhibitor. Sepsis induced ARDS 
results in more severe DIC and fibrinolysis, more fibrin degradation products and reduced ATIII levels compared 
to trauma induced ARDS (Carvalho, A. C. et al., (1988) J. Lab. Clin. Med. 1 12:270-277). 
Disseminated Intravascular Coagulation 

Disseminated intravascular coagulation (DIC) is a disorder that occurs in response to tissue injury and 
invading microorganisms characterized by widespread deposition of fibrin and depleted levels of fibrinogen 
(Muller-Berghaus, G. (1989) Semin. Thromb. Hemostasis, 15:58-87). There are prolonged prothrombin and 
activated partial thromboplastin times. DIC has been observed in the clinical settings of a wide variety of diseases 
(Fruchtman, S. M. & Rand, J. H. in Thrombosis in Cardiovascular twh,^ Fuster, V. & Verstraete M. eds., 
40 (1992) pp 501-513 W.B. Saunders, Philadelphia). 
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Hypotension, DIC, and neutrophil activation are all triggered by the interaction of Factor Xlla, plasma 
kininogens and kallikrein. Deficiency of any of these 3 proteins does not give rise to hemostatic disorders due 
to redundancy in the system due to platelets, other coagulation factors, and endothelial cells. 
Plasma Kallikrein tnhlhi^ m 

> A large number of therapeutic approaches to septic shock and related disorders have been identified 

including various cytokine antagonists, Mabs (to endotoxin, tissue factor, tumor necrosis factor (TNF), 
neutrophils, etc.), kinin antagonists, bacteriocidal permeability increasing protein, PAF antagonists, C 1 inhibitor, 
DEGR-FXa, and activated protein C, among others. It is possible, due to the complicated nature of the disease, 
that an approach that involves multiple agents or agents that affect multiple pathways may be successful in the 
treatment of septic shock (Schmeichel C. J. & McCormick D., aipja). Potent serine protease inhibitors that 
reversibly inhibit proteases of the coagulation, contact activation, fibrinolysis, inflammation, complement 
activation, and hypotensive pathways are one approach to the treatment of diseases that are affected by these 
pathways. 

Protein inhibitors play critical roles in the regulation of proteases in a wide variety of physiological 
processes. The major physiological inhibitor of plasma kallikrein is CI inhibitor, a serpin which results in 
irreversible inhibition. CI inhibitor is also the major physiological inhibitor of FXIIa, and the complement 
pathway proteases Clr and Cls. • cc2-macroglobulin, another major inhibitor of kallikrein, inhibits the kinin- 
forming function whUe only partially inhibiting esterolytic activity. Antithrombin-in can also inhibit kallikrein, 
but slowly even in the presence of heparin. a2-antiplasmin and o 1 -antitrypsin are poor inhibitors of kallikrein. 
2 0 A mutant form of a r proteinase inhibitor (a ,-proteinase inhibitor-Pittsburgh) that contains an Arg in the P, 
position and an Ala in the P 2 position has been shown to be a more potent inhibitor of Factor Xllf (FXIIf) and 
kallikrein compared to C 1 inhibitor, the most potent known natural inhibitor of these proteases (Schapira, M. 
• et al., (1987) J. Clin. Invest 80:582-585; Patston, P. A. et al., (1990) J. Biol. Chem. 265: 10786-10791). Rats 
treated with this mutant were partially protected from the hypotension resulting from injection of FXIIf. 

Recently, ecotin, a 142 residue protein from E. coli , has been shown to potently inhibit plasma kallikrein with 
aKjofca. 1 60 P M; however ecotin is not totally selective and also potently inhibits Factor Xlla, Factor Xa and 
human leukocyte elastase (Seymour, J.L., et al., (1994) Biochemistry 33:3949-3958). 

Bovine pancreatic trypsin inhibitor (BPTI, aprotinin) is a well-studied member of the Kunitz domain 
family of serine protease inhibitors that moderately inhibits plasma kallikrein with Kj of ca. 30 nM (Fritz, H., 
and Wunderer, G., (1983) Arzeim.-Forsch. Drug Res. 33:479-494; Creighton, T.E., and Charles, I.G. (1987) 
Cold Spring Harbor Symp. Quant BioL 52:51 1-519). However, BPTI is a more potent inhibitor of plasmin. 

Aprotinin has been used in a wide variety of clinical states including acute pancreatitis, septic and 
hemorrhagic shock, adult respiratory distress syndrome and multiple trauma; recently it has shown promise both 
clinically and in models of cardiopulmonary bypass (Westaby, S., (1993) Ann. Thorac. Surg. 55:1033-1041 ; 
35 Watchfogel, T.,etal.,(1993)J. Thorac. Cardiovasc. Surg. 106:1-10). 

As a broad spectrum Kunitz type serine protease inhibitor, aprotinin can prevent activation of the 
clotting cascade initiated by the cqntact activation pathway. It can also prevent activation of neutrophils and 
other inflammatory responses resulting from tissue damage caused by ischemia and hypoxia. These benefits are 



25 



30 



-3- 



W0963951 9 [file:/A\dcwasQ3\firmdata\lp\FoleyPat\PatentDocuments\WQ963951 9.CPC ] _ Pa g e 6 of 71 

WO 96/39519 PCT/US96/09059 

believed to be derived from its kallikrein or plasmin inhibitory activity; however, the feet that aprotinin is neither 
very potent nor selective make it difficult to interpret these effects. 

Aprotinin inhibits the contact, neutrophil, and platelet activation systems during simulated 
extracorporeal perfusion as evidenced by a reduction in blood loss, prevention of neutrophil degranulation, 
5 platelet activation and aggregation, and formation of kallikrein-Cq-inhibitor and Cl-Cl inhibitor complexes 
(Watchfogel, et al., (1993) J. Thorax. Cardiovasc. Surg. 106:1-10). It has been used to inhibit plasma kallikrein 
during LPS induced endotoxic shock in pigs with the result of preventing arterial hypotension (Seibeck, M. et 
aJ.,(1993)J.Trauma34:193-198). Aprotmto has also been used to mvesti^^^ 

kallikrein-kinin system in the haemodynamic and renal function in patients with hepatic cirrhosis (MacGilchrist, 
10 A., (1994) Clin. Sci. 87:329-335). Aprotinin has been shown to be beneficial to renal hydrodynamics resulting 

in improved renal flow and filtration. 

Traysylol® (aprotinin, Bayer AG, Leverkusen, Germany) is currently indicated to inhibit the contact 

system of plasma which is massively activated on the first passage of blood through the cardiopulmonary bypass 

circuit during bypass procedures (Westaby, S. (1993) Ann. Thorac. Surg. 55: 1033-4 1). Aprotinin blocks contact 
15 activation of the kallikrein system during cardiopulmonary bypass and acts in synergy with heparin in preventing 

thrombus formation through inhibition of the intrinsic clotting cascade (Westaby, supra l Trasylol® has also 

been used prophylactically in septic shock, hemorrhagic shock, post operative necrotic pancreatitis, and post 

operative pulmonary embolism. 

2 0 Kunitz-type serine protease inhibitors (BPTI, Aprotinin, for example) are a well characterized family 

of proteins that exhibit extensive structural homology including a characteristic tertiary fold containing an 
extended binding loop that fits into the active site of the cognate serine protease (Bode, W., and Huber, R., 
(1992) Eur. J. Biochem. 204:433-451). Kunitz type serine protease inhibitors are known to be slow, tight- 
binding, reversible.inhibitors of serine proteases that bind to the active site and inhibit according to the standard 
25 mechanism (Laskowski, Jr., M. and Kato, I., (1980) Ann. Rev. Biochem. 49:593-626). Cleavage between the 
Pj and P,' residues occurs very slowly if at all (Bode, W. and Huber, R., (1992) Eur. J. Biochem. 204:433-45 1 ; 
Laskowski, M, Jr. and Kato, I., (1980) Annu. Rev. Biochem. 49:593-626; The ? { residue refers to the position 
preceding the scissile peptide bond of the substrate or inhibitor and fits into the Sj binding site as defined by 
Schecter and Berger (1967) Biochem. Biophys. Res. Commun., 27: 1 57-162. The residue numbering corresponds 
30 to that of BPTI such that residue 15 is at the Pj position). 

Some of the contact residues in the binding loop (positions 1 1, 15, 17, and 19) are relatively variable 
among Kunitz domains (Creighton, T. E. and I. G. Charles, (1987) Cold Spring Harbor Symp. Quant. Biol. 
52:51 1-519). Position 13 is normally a Pro. Position 12 is almost always a Gly. The cysteine residues at 
positions 14 and 38 that form a disulfide bond are always found in Kunitz domains; however other residues such 
35 as Ala, Gly, Ser, or Thr may substitute for the cysteines (Marks, C. B. et al., (1987) Science, 235: 1370-1373). 

In the 58 residue Kunitz protease inhibitor domain of the Alzheimer's amyloid B-protein precursor 
(APPI) and other Kunitz domains, residues 13 and 39 as well as residues 1 7 and 34 are in close proximity (Figure 
3) (Hynes, T. R. et al., (1990)Biochemistry 29: 10018-10022). Residues at positions 16 and 18 are generally 
more invariant among Kunitz domains (Creighton, T. E. and I. G. Charles, (1987) Cold Spring Harbor Symp. 
40 Quant. Biol. 52:5 11 -51 9); howeveY, different residues at these positions may also alter binding. Therefore, 
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residues at positions 1 1 tbrough 19, 34, 38, and 39 may all affect the binding affinity and specificity towards 
serine proteases (Figure 3). 

Other residues are important as well. For instance, APPI and BPTI have a methionine at position 52, 
although other Kunitz domains have a variety of residues at this position (Figure 2). Methionine at this position 
5 can be replaced by different residues which may be beneficial with respect to producing the protein. For example, 
methionine is susceptible to oxidation to form methionine sulfoxide, which can complicate purification. Also 
protein can be made recombinantly as a fusion protein, followed by cleavage with CNBr, which cleaves at 
methionine residues (Auerswald, E. A. et al., (1988) Biol. Chem. Hoppe-Seyler 369: 27-35). Therefore, it is 
necessary to remove other methionine residues in the protein of interest to produce intact product Substitutions 

10 at position 52 are not expected to have major effects on inhibitory activity since it is so far away from the primary 
binding loop of the Kunitz domain (Figure 3). 

Substrates and inhibitors of trypsin-like proteases such as kallikrein and FXIa have either Arg or Lys 
at the Pj residue (EP O 339 942 A2). However methionine is sometimes found at the Pj position and may also 
be preferable for good inhibition of serine proteases (McGrath, M. E. et al, (1991) J. Biol. Chem. 266:6620- 

15 6625). The introduction of residues such as Val, Leu, or He at the P j position of Kunitz domains leads to potent 
inhibitors of human leukocyte elastase (HLE) and concomitant loss of the wild type inhibitory activity 
(Beckmann, J. et al., (1991) Eur. J. Biochem. 176:675-682; Sinha, S. et al.,(1991) J. Biol. Chem. 266:2101 1- 
21013). 

Recently, APPI, which is structurally similar to BPTI (Hynes, T., et al., (1990) Biochemistry 29:10018- 

2 0 10022), was used as a scaffold for phage display of a large library of variants to select potent and specific active 

site inhibitors of tissue fector-Factor Vila (TF-FVIIa) (Dennis, M.S., and Lazarus, R.A., (1994) J. Biol.Chem. 
269:22129-22136; Dennis, M.S.,. and Lazarus, R.A., (1994)J. Biol. Chem. 269:22137-22144). A direct 
comparison of variants produced in that study against tissue factor-Factor Vila showed a 95-fold increase in 
binding affinity for plasma kallikrein by changing the Pj residue from Lys to Arg. The data were not inconsistent 
2 5 with those presented earlier for BPTI where Arg and Lys were studied at position Pj (Scott, C.F., et al., Blood 
69:1431-1436). In this study, a change of Lys to Arg at position Pj resulted in a 20 fold more potent inhibitor 
of plasma kallikrein. Neither of these studies suggested a specific inhibitor of plasma kallikrein since the 
proteases demonstrated inhibition of other serine proteases such as tissue factor-Factor Vila, Factor XIa or 
plasm in. 

30 APPI has been readily expressed in bacteria such as E. coll (Castro, M. et al., (1990) FEBS Lett. 

267:207-212) and yeast such as P. pastoris. The x-ray crystal structure of the protein is known (Hynes, T. R. 
et al., (1990) Biochemistry 29:10018-10022). Additionally, it is derived from a human sequence, which would 
minimize the immunogenicity for any therapeutically useful variants. 

Despite recent advances, there remains a need for potent specific inhibitors of plasma kallikrein for the 

3 5 development of targeted therapeutic tools. 

Surn^arvvfthe tnyentiQn 
The present invention provides for compositions capable of potent reversible inhibition of proteases 
of the coagulation, contact activation, fibrinolysis, inflammation, complement activation, and hypotensive 
pathways. In particular the invention provides for the potent inhibition of plasma kallikrein. The compositions 
40 of the present invention comprise a serine protease inhibitor polypeptide. The polypeptide inhibitor of the 
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present invention comprises at least one non-native Kunitz-type serine protease inhibitor domain capable of the 
potent inhibition of plasma kallikrein. 

In a preferred embodiment the invention provides for a polypeptide comprising at least one non-native 
Kunitz-type serine protease inhibitor domain said Kunitz-type serine protease inhibitor domain having a primary 
5 binding loop: 

Xaa 5 -Xaa 4 -Xaa3-Xaa2-Xaa ^ Xaa 1 , -Xaa 2 , -Xaa3 , -Xaa 4 , 
and a secondary binding loop comprising: 
Xaajg* 

wherein Xaa 5 is an amino acid selected from the group consisting of Pro, Asp, Glu, Ser, Thr, Arg, and Leu; Xa^ 
is the amino acid Gly; Xaa 3 is an amino acid selected from the group consisting of His, Pro, Arg, Leu, Gly, and 
Thr; Xaa2 is Cys; Xaaj is Arg; Xaa^ is an amino acid selected from the group consisting of Ala and Gly; Xaa^ 
is an amino acid selected from the group consisting of Ala, Leu, Asn, Trp and Ser; Xaa 3 ' is an amino acid 
selected from the group consisting of His and He; Xaa 4 ' is selected from the group consisting of Pro, Tyr, Leu, 
and Trp; and Xaa 19 ' is an amino acid selected from the group consisting of Val, Tyr, Trp, Ser, and Phe. 

In a further preferred aspect of the present invention the polypeptide comprises a Kunitz-type serine 
protease inhibitor domain having the sequence: 
R r Xaa 5 -Xaa 4 -Xaa 3 -Xaa2^ 

where R { is a 10 amino acid peptide and the amino acid corresponding to amino acid position Xaaj j is Cys; R2 
is a 14 amino acid peptide wherein the amino acid corresponding to amino acid position Xaa 15 * is Cys; and R3 
is a 24 amino acid peptide wherein the amino acid corresponding to amino acid positions Xaa2 3 r , Xaa 36 ' and 
Xaa 40 ' are Cys. According to this aspect of the present invention the polypeptide of the present invention 
comprises a Kunitz-type domain of approximately 58 amino acids having a primary binding loop comprising 
amino acids Xaa 5 -Xaa 4 -Xaa 3 -Xa^ and a secondary binding loop comprising 

amino acid Xaa 19 ' as defined above for the invention. 
2 5 Exemplary Kunitz type domains have an Rj selected from the group consisting of 

pVREVCSEQAE (SEQ ID NO: 6), 
MHSFCAFKAD (SEQ ID NO: 7), 
KPDFCFLEED (SEQ ID NO: 8), 
GPSWCLTPAD (SEQ ID NO: 9), 
30 KEDSCQLGYS (SEQ ID NO: 10), 

TVAACNLPIV (SEQ ID NO: 11), 
LPNVCAFPME/SEQ ID NO: 12), and 
RPDFCLEPPY (SEQ ID NO: 13); 
R2 is selected from the group consisting of 
35 RWYFDVTEGKCAPF (SEQ ID NO: 14), 

RFFFNIFTRQCEEF (SEQ ID NO: 15), 
RYFYNNQTKQCERF (SEQ ID NO: 16), 
RFYYNSVIGKCRPF (SEQ ID NO: 17), 
RYFYNGTSMACETF (SEQ ID NO: 18), 
4 0 LWAFDAVKGKCVLF (SEQ ID NO: 19), 

-6- 
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KWYYDPNTKSCARF (SEQ ID NO: 20), 
RWFFNFETGECELF (SEQ ID NO: 21), and 
RYFYNAKAGLCQTF (SEQ ID NO: 22); and 
R 3 is selected from the group consisting of 
5 YGGCGGNRNNFDTEEYCAAVCGSA (SEQ ID NO: 23), 

YGGCGGNRNNFDTEEYCMA VCGSA (SEQ ID NO: 24), 
YGGCEGNQNRFESLEECKKMCTRD (SEQ ID NO: 25), 
YGGCLGNMtiNFETLEECKNICEDG (SEQ ID NO: 26), 
YSGCGGNENNFTSKQECLRACKKG (SEQ ID NO: 27), 
1 0 YGGCMGNGNNFVTEKECLQTCRTV (SEQ ID NO: 28), 

YGGCQGNGNKFYSEKECRE YCG VP (SEQ ID NO: 29), 
YGGCGGNENKFGSQKECEKVCAPV (SEQ ID NO: 30), 
YGGCGGNSNNFLRKEKCEKFCKFT (SEQ ID NO: 3 1 ), and 
YGGCRAKRNNFKS AEDCMRTCGGA (SEQ ID NO: 32). 
1 5 Preferred polypeptides also include polypeptides that inhibit plasma kallikrein and comprise a Kunitz 

type serine protease inhibitor domain having a primary and secondary binding loop as described above and 
wherein Rj represents amino acid residues 1-10 of APPI (SEQ ID NO: 6) or conservative amino acid 
substitutions thereof; R2 represents amino acid residues 20-33 of APPI (SEQ ID NO: 14) or conservative amino 
acid substitutions thereof; and R3 represents amino acid residues 35 through 58 of APPI (SEQ ID NO: 23) or 
2 0 conservative amino acid substitutions thereof 

In a preferred aspect the polypeptide comprises a Kunitz-type serine protease inhibitor domain having 
a primary binding loop Xaa 5 -Xaa 4 -Xaa3-Xaa 2 -Xaa 1 -Xaa 1 , -Xaa2 , -Xaa3 , -Xaa 4 , wherein Xa^ is an amino acid 
selected from the group Glu, Asp, and Pro; Xaa 4 is Gly; Xaa 3 is His; Xaa2 is Cys; Xaaj is Arg; Xaa^ is Ala; 
Xaa2' is Alia; Xaa 3 ' is His; and Xaa 4 ' is Pro; and a secondary binding loop comprising Xaa J9 ' where Xaa 19 ' 
25 is selected from Val, Trp and Tyr. 

According to the present invention, the serine protease inhibitors are capable of the potent inhibition 
of plasma kallikrein. Therefore, a preferred polypeptide within the context of the present invention has an 
apparent dissociation constant (Kj*) for plasma kallikrein of less than about 500 picomolar (pM). More 
preferably, the polypeptides of the present invention have a Kj for plasma kallikrein of less than about 300 pM 
30 and most preferably less than about 1 00 pM. In a preferred aspect of the present invention the serine protease 
inhibitors are capable of the potent and specific inhibition of plasma kallikrein. According to this aspect of the 
present invention the serine protease inhibitors inhibit plasma kallikrein and are not capable of appreciable 
inhibition of other serine proteases of the coagulation cascade such as Factor Xa, tissue factor-Factor Vila, 
thrombin, Factor XHa, or activated protein C. 
35 In a further embodiment, the present invention encompasses a composition of matter comprising isolated 

nucleic acid, preferably DNA, encoding the polypeptide of the invention. The invention further comprises an 
expression control sequence operably linked to the DNA molecule, an expression vector, preferably a plasmid, 
comprising the DNA molecule, where the control sequence is recognized by a host cell transformed with the 
vector, and a host cell transformed with the vector. 
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Prefenred expression vectors of the present invention may be selected from, for example; pBR322, 
phGHl, pB0475, pRIT5, pRTHT, pKK233-2, pDR540, and pPL-lambda, with the most preferred vector being 
pSAlzl. 

Preferred host cells containing the expression vector of the present invention may be selected from, for 
5 example, E coli K12 strain 294 (ATCC No. 3 1446), E coli strain JM101, K coli B, E coli X1776 (ATCC No. 
3 1537X E coli c600, E coli W3 1 10 (F-, gamma-, prototrophic, ATCC No. 27325), Bacillus subtilis, Salmonella 
typhimurium, Serratia marcesans, and Pseudomonas species, with the most preferred host cell being E. coli 
W31 10 (ATCC No. 27325), or a derivative thereof such as the protease deficient strain 27C7 (ATCC No. 
55244). 

10 The composition of the present invention may be made by a process which includes the steps of 

isolating or synthesizing nucleic acid sequences encoding any of the amino acid sequences described above, 
ligating the nucleic acid sequence into a suitable expression vector capable of expressing the nucleic acid 
sequence in a suitable host, transforming the host with the expression vector into which the nucleic acid sequence 
has been ligated, and culturing the host under conditions suitable for expression of the nucleic acid sequence, 
15 whereby the protein encoded by the selected nucleic acid sequence is expressed by the host. Preferably, the 
polypeptide is then recovered from the host cell culture. In this process, the ligating step may further 
contemplate ligating the nucleic acid into a suitable expression vector such that the nucleic acid is operabry 
linked to a suitable secretory signal, whereby the amino acid sequence is secreted by the host. The secretory 
signal may be selected from the group consisting of the leader sequence, for example, of stll, lamB, herpes gD, 
2 o Ipp, alkaline phosphatase, invertase, and alpha factor and is preferably stll. 

The present invention further extends to therapeutic applications for the compositions described herein. 
Thus the invention includes a pharmaceutical composition comprising a pharmaceutical^ acceptable excipient 
and the purified polypeptide of the invention. 

Those applications include, for example, a method of treating a mammal for which inhibiting plasma 

2 5 kallikrein is indicated comprising administering a pharmaceutically effective amount of the pharmaceutical 

composition to the mammal. Such indications include; inflammation, septic shock, hypotension, ARDS, DIC, 
cardiopulmonary bypass surgery, and bleeding from postoperative surgery. 

Description of the Drawings 
Figure 1. Schematic outline of selected enzymes and mediators that modulate the coagulation, contact, 

3 0 fibrinolytic, inflammatory, and complement pathways. Activation of these pathways can lead to the clinical states 

indicated. 

Figure 2. Sequence alignment of Kunitz domains from mammalian sources. Aligned are, APPI (residues 
1-58) (SEQ ID NO: 34) from human Alzheimer's disease amyloid P-protein precursor, residues 287-344 (Castro, 
M. et al. (1990) FEBS Lett 267:207-212); TFPI-KD1 (Kunitz domain l)(residues 22-79) (SEQ ID NO: 35), 

35 TFPI-KD2 (residues 93-150) (SEQ ID NO: 36X and TFPI-KD3 (residues 1 85-242) (SEQ ID NO: 37) of human 
TFPI (tissue factor protein inhibitor or LACI, Broze Jr., G. J. et al., (1990) Biochemistry 29:7539-7546); ITI- 
KD1 and ITI-KD2, (residues 22-79 and 78-135) (SEQ ID NO: 38 and 39) of human inter- a -trypsin inhibitor, 
respectively (Vetr, Ji et al., (1989) FEBS Lett. 245:137-140); Collagen a 3 (VI) (residues 2899-2956) (SEQ 
ID NO: 40) of Collagen a 3 (VI) chain precursor (Chu, M. L. (1990) et al. EMBO J. 9:385-393); HKIB9 (7-60) 

40 (SEQ ID NO: 41) Human Kunitz-type protease inhibitor, HKIB9 (Norris, K., in Genbank Database (Dec 3 1, 
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1993, Release 39.0), submitted Jan 19,1994); BPTI (1-58) (SEQ ID NO: 42), Aprotinin or bovine basic 
pancreatic trypsin inhibitor (Creighton T. E. and Charles, L G., (1987) Cold Spring Harbor Symp. Quant. Biol. 
52:51 1-5 19). A motif alignment of invariant residues is also listed. 

Figure 3. Model of APPI and other Kunitz domains. The numbers refer to the residues found in APPI 
5 and other Kunitz domains; residue 15 corresponds to the P, residue. The shaded area refers to the primary 
(residues 1 1-19) and secondary (comprising residue 34) binding loops of APPI and other Kunitz domains. 
Disulfide bonds between cysteine residues at positions 14 and 38, 5 and 55, and 30 and 51, are indicated as 
dashed lines. 

Figure 4 A and Figure 4 B. Determination of the apparent equilibrium dissociation constants of 
10 selected Kunitz domains with plasma kallikrein and FXIa. The inhibitory activity is expressed as the fractional 
activity (inhibited rate/uninhibited rate) at varying inhibitor concentrations. The apparent equilibrium 
dissociation constants were determined by nonlinear regression analysis of the data to equation 1. Shown in 
Figure 4 A is the fractional activity of 0.5 nM plasma kallikrein remaining in the presence of: APPI (•), BPTI 
(■), KALI-10 (o) and KALI-DY (□). Shown in Figure 4B is the fractional activity of 3.5 nM FXIa remaining 
15 in the presence of: APPI (•), BPTI (■), KALI-10 (o) and KALI-DY (□). Kj* values are reported in Tables III 
and IV. 

Figure 5A and Figure 5B. The concentration of APPI (•), BPTI (■) and KALI-DY (□) are plotted 
vs. the fold prolongation of clotting time upon initiation by ellagic acid in the APTT assay (Figure 5A) or by TF 
membranes in the PT assay (Figure 5B). The uninhibited clotting times were 33.6 sec and 14 sec for the APTT 
20 and PT, respectively. 

Detailed Description of the Invention 

I. Definitions 

Abbreviations used throughout the description include: FXIIa for Factor Xlla; HMWK for high 
molecular weight kininogen; FXIa for Factor XIa; FXa for Factor Xa; TF for tissue factor; FVIIa for Factor Vila; 

2 5 BPTI for basic pancreatic trypsin inhibitor, APPI for Alzheimer's amyloid 6- protein precursor inhibitor; K * for 

apparent equilibrium dissociation constant, BSA for bovine serum albumin; HPLC for high performance liquid 
chromatography; PT for prothrombin time; APTT for activated partial thromboplastin time. 

The terms "Kunhz-type serine protease inhibitor domain," 'Kunitz-type domain," "Kunitz domain," and 
the like are used interchangeably herein to refer to an approximately 58 amino acid residue protein domain 

3 0 characterized by a conservation of cysteine residues with the serine protease inhibitor BPTI (bovine pancreatic 

trypsin inhibitor; Creighton and Charles, (1987) Cold Spring Harbor Symp. Quant. Biol. 52:511-519) first 
isolated in crystalline form in 1936 (Kunitz, M., and Northop, J.H., (1936) J. Gen. Physiol. 19:991-1007). 
Kunitz domains share cysteine residue placement, tertiary folding, and structural characteristics. Figure 3 is a 
model of the tertiary structure of the Kunitz domain of Alzeheimer's amyloid B-protein precursor showing 

35 cysteine residue placement. 

A family of proteins has been identified containing one, two or three Kunitz domains. The family 
includes; LACI (lipoprotein-associated coagulation inhibitor, also TFPI or tissue factor protein inhibitor; Broze, 
Jr. G. J., et al., (1990) Biochemistry 29:7539-7546); APPI (Alzheimer's amyloid P-protein precursor; Hynes, 
T.R., (1990) BicKtoemistry 29:10018-10022); the a-3 chain of human type VI collagen (see WO 93/141 19) and 

40 mter-a-trypsm ^ Figure 2 presents 
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the sequence alignment of Kunitz domains from mammalian sources. Kunitz-type domains have also been 
identified in red sea turtle egg white, chelonianin (Kato, L, and Tominaga, N., (1 979) Fed. Proc. 38:3342-33 57), 
and B-chain of Bj bungarotoxin (Kondo, K, et al., (1982) J. Biochem. 91:1519. They have also been identified 
in many snake venoms. Kunitz domains contain six specifically spaced cysteines that are present naturally 
5 in disulfide bonds (Bode, W., and Huber, R., (1992) Eur. J. Biochem. 204:433-45 1). The three disulfide bridges 
stabilize the protein and are partially responsible for the overall 3-dimensional folding characteristic of a Kunitz 
domain (Figure 3). In the 58 residue Kunitz type serine protease BPH and APPI, cysteines are present at 
residues 5, 14, 30, 38, 5 1 , and 55 The removal of a one disulfide bridge, however is not accompanied by a large 
structural change (Eigenbrot, C, et al., (1990) Protein Eng. 3:591-598. 

10 The crystal structure of Kunitz domain type serine protease inhibitors has been determined for BPT1 

(SUEffi) and APPI (Hynes, T.R, et al, (1990) Biochem., 29:1 001 8-10022). A central anti-parallel three-stranded 
B-sheet and a C-terminal a-helix form the core of the domain (Bode, W., and Huber, W., sufita). The segments 
of the core domain form the supporting scaffold for the exposed primary binding loop of the properly folded 
protein (the "primary binding loop" as defined hereinXBode, W., and Huber, R., suoraV A secondary binding 

15 loop participates along with the primary binding loop to define the interface between the Kunitz domain and the 
cognate protease target (the "secondary binding loop" as defined herein)(R0hlman, A., et al., (1973) J. Mol. Biol. 
77:417-436). 

"Non-native" as used herein is meant to refer to Kunitz-type domains having an amino acid sequence 
which is different from the naturally occurring Kunitz domains. Examples of naturally occurring Kunitz domains 
20 are, for example, those described in Figure 2 and in the Description of Related Art. The amino acid sequence 
of the non-native Kunitz type domains of the present invention differ from the naturally occurring Kunitz domain 
at least by virtue of the modifications to the primary and secondary binding loops as described herein. In one 
embodiment the non-natural Kunitz type domain is derived from a naturally occurring Kunitz domain by 
substitution of one or more amino acids of the naturally occurring Kunitz domain. Such a modification may be 
made for example by altering the DNA sequence encoding a naturally occurring Kunitz domain. In some 
instances, the non-native Kunitz-type domain may be derived from a naturally occurring Kunitz type domain by 
direct chemical modification of one or more of the amino acid side chains of the naturally occurring Kunitz 
domain. 

The "primary binding loop" of a Kunitz domain is designated by ^ s'^^'^l'^r^ I m ^2 m ^3' p 4 
(residues 11-19 of BPH and APPI). The "secondary binding loop" is designated p l9 Lp 20 , - p 2^• p 22 , • p 23 , ~ p 24 , 
(residues 34-39 of BPTI and APPI). 

The term "Pj " is used herein to refer to the position preceding the scissile peptide bond of the serine 
protease inhibitors as previously defined by Schecter, I., and Berger, A., (1967) Biochem. Biophys. Res. 
Commun. 27: 157-162. Similarly, the term "Pf is used to refer to the position following the scissile. peptide 
bond of the inhibitor. Increasing numbers refer to the next consecutive position preceding (e.g., P 2 and P 3 ) and 
following (e.g., P 2 * and P 3 ') the scissile bond. The residue numbering corresponds to that of BPTI such that 
residue 15 is at the P| position. 

In the polypeptides of the invention the designations Xaa replace P in referring to amino acid positions, 
nierefore Xaaj is equivalent to Pj and Xaa j 1 is equivalent to Pj*. 
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APPI refers to the 58 amino acid polypeptide from human Alzeheimer's disease amyloid 0-protein 
precursor, residues 287-344 (Castro, et al., (1990) FEBS Lett. 267:207-212). In this protein P5-P4-P3-P2-P1- 
P1-P2 • p 3" p 4 t corresponds to residues 1 1-19 of the primary binding loop. P 19 * corresponds to residue 34 of 
the secondary binding loop. 

5 The term "amino acid" within the scope of the present invention is meant to refer to naturally occurring 

L alpha amino acids or residues. The commonly used one and three letter abbreviations for amino acids are used 

herein (Lehninger, AX., Biochemistry, 2d ed, pp. 71-92, (1975), Worth Publishers, New York). 

The term "conservative 11 amino acid substitution as used within this invention is meant to refer to amino 

acid substitutions which substitute functionally equivalent amino acids. Conservative amino acid changes result 

10 in silent changes in the amino acid sequence of the resulting protein. For example, one or more amino acids of 

a similar polarity act as functional equivalents and result in a silent alteration within the amino acid sequence of 

1 

the protein. Conservative amino acid substitution have been defined as the amino acid substitutions set forth in 
Table 1 on page 240 of Taylor, W.R., (1986) J. Mol. Biol. 188233-258. The largest sets of conservative amino 
acid substitutions include: 
15 ( 1 ) hydrophobic: His, Trp, Tyr, Phe, Met, Leu, He, Val, Ala; 

(2) neutral hydrophilic: Cys, Ser, Thr; 

(3) polar: Ser, Thr, Asn, Gin; 

(4) acidic/negatively charged: A$p, Glu; 

(5) charged: Asp, Glu, Arg, Lys, His 

20 (6) basic/positively charged: Arg, Lys, His; 

(7) basic: Asn, Gin, His, Lys, Arg; 

(8) residues that influence chain orientation: Gly, Pro; and 

(9) aromatic: Trp, Tyr, Phe, His. 

In addition structurally similar amino acids can substitute for some of the specific amino acids. Groups of 
2 5 structurally similar amino acids include: (lie, Leu, and Val); (Phe and Tyr); (Lys and Arg); (Gin and Asn); (Asp 
and Glu); and (Gly, and Ala). Exemplary conservative amino acid substitutions are preferably made in 
accordance with the following: Gly or Ser for Ala; Lys for Arg; Gin or His for Asn; Glu for Asp; Ser for Cys; 
Asn for Gin; Asp for Glu; Ala or Pro for Gly; Asn or Gin for His; Leu or Val for He; He or Val for Leu; Arg, Gin, 
or Glu for Met; Met, Leu or Tyr for Phe; Uir for Ser; Ser for Thr, Tyr for Trp; Trp or Phe for Tyr; He or Leu 
30 for Val. 

"Expression vector" refers to a DNA construct containing a DNA sequence which is operably linked 
to a suitable control sequence capable of effecting the expression of the protein encoded by the DNA in a suitable 
host. Such control sequences generally include a promoter to effect transcription, an optional operator sequence 
to control transcription, a sequence encoding suitable mRNA ribosome binding sites, and sequences which 
35 control termination of transcription and translation. The vector may be a plasmid, a phage particle or 
"phagemid", or simply a potential genomic insert 

Once transformed into a suitable host, the vector may replicate and function independently of the host 
genome, or may, in some instances, integrate into the genome itself. In the present specification, "plasmid", 
"vector" and "phagemid" are sometimes used interchangeably as the plasmid is the most commonly used form 
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of vector at present. However, the invention is intended to include such other forms of expression vectors which 
serve equivalent functions and which are, or which become, known in the art 

"Operably linked," when describing the relationship between two DNA or polypeptide sequences, 
simply means that they are functionally related to each other. For example, a presequence is operably linked to 
5 a peptide if it functions as a signal sequence, participating in the secretion of the mature form of the protein most 
probably involving cleavage of the signal sequence. A promoter is operably linked to a coding sequence if it 
controls the traiiscription of the sequence; a ribosome binding site is operably linked to a coding sequence if it 
is positioned so as to permit translation. 
Discovery and Preferred Embodiments 
10 The present inventors have discovered that replacement or substitution of certain key amino acids found 

at positions within and around the primary and secondary binding loops of Kunitz-type serine protease inhibitors 
can dramatically improve the potency of the inhibitors toward plasma kallikrein. The present invention therefore 
provides for polypeptides which comprise one or more non-native Kunitz-type domains that are designed to 
potently inhibit plasma kallikrein. 
1 5 According to the present invention, residues 1 1 - 19, and 34 corresponding to residues P5-*V P 3" P 2" P 1 - 

P r p 2 , " p 3" p 4' 31,(1 p 19 of a Kunitz-type serine protease inhibitor are selected from among the naturally 
occurring amino acids such that potent inhibition of plasma kallikrein is achieved. Of the many interactions 
between the serine protease subsites and the side chains in the primary binding loop of Kunitz-type domain serine 
protease inhibitors (P 5 -P 4 0 (Bode, W. and Huber, R., (1992) Eur. J. Biochem. 204:433-451 ; Laskowski, M., 
20 Jr. and Kato, L, (1980) Annu. Rev. Biochem. 49:593-626), the interactions of the Pj residue with the specificity 
pocket are energetically most important and therefore represent the primary specificity determinants (see Figure 
3). In a Kunitz type domain/serine protease complex, the side chain of residue 15 of the Kunitz-type domain fills 
the Pj position preceding the scissile peptide bond. According to a preferred aspect of the present invention, 
plasma kallikrein is potently inhibited by polypeptides comprising a Kunitz-type domain wherein an Arg is found 
25 atposition 15(P,). While preferred polypeptides comprise a Kunitz domain with Arg at position P, Lysisalso 
contemplated at P|. 

The crystal structures of Kunitz-type domains reveal other residues within the primary binding loop that 
are likely to make contact with the serine protease (Hynes, T. R. et al., (1990) ajpia; Bode, W. and Huber, R, 
(1992) supra; KossiakofT, A. A. (1993) et al., Biochem. Soc. Trans. 21:614-618). Although the amino acid at 
the Pj position generally dominates the affinity of inhibitors for the serine protease active site (Scott, C. F. et 
al., (1987) Blood 69:1431-1436; Laskowski, M, Jr. and Kato, I, (1980) sum; Beckmann, J. et al., (1988) Eur. 
J. Biochem. 176:675-682; Sinha, S. et al., (1991) J. Biol. Chem. 266:2101 1-21013), residues outside this region 
(1 1-14 and 16-19 as well as residue 34 of secondary binding loop) are also known to play a role in binding 
affinity and specificity towards serine proteases (KossiakofT, A. A. et al., (1993) supra; Roberts, B. L. et al., 
35 (1992) Proc Natl Acad Sci USA 89: 2429-2433). 

The present inventors have discovered that potent plasma kallikrein inhibitors result when, in addition 
to an Arg at position 15, amino acids residues at positions 1 1-14 and 16-19 in the primary binding loop are Ser, 
Thr, Arg, Leu, Asp, Pro or Glu at position 1 1 (P 5 ); Gly at position 12 (P 4 ); Gl y> Thr, His, Pro, Arg, or Leu, at 
position 13 (P 3 ); Ala or Gly at position 16 (Pf); Ser, Ala, Leu, Asn, or Trp, at position 17 (P 2 '); His or He at 
position 1 8 (P 3 -); and Pro, Tyr, Leu or Trp at position 19 (P 4 '). The Cys normally present at positions 14 (and 
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38) of a Kunitz type serine protease inhibitor is maintained. At the secondary binding loop at position 34 (Pj g 1 ) 
amino acids Phe, Val, Tyr, Tip and Ser are preferred. 

More preferably, potent plasma kallikrein inhibitors result when, in addition to an Arg at position 1 5, 
amino acid residues at positions 1 1-14 and 16-19 in the primary binding loop are Asp, Pro or Glu at position 1 1 
5 (P 5 ); Gly at position 12 (P^; His, Pro, Arg, or Leu, at position 13 (P 3 ); Ala or Gly at position 16 (P^; Ala, Leu, 
Asn, or Tip, at position 17 (P^; His or lie at position 18 (P 3 f ); and Pro, Tyr, Leu or Trp at position 19 (P 4 '). 
Hie Cys normally present at positions 14 (and 38) of a Kunitz type serine protease inhibitor is maintained. At 
the secondary binding loop at position 34 (Pjp 1 ) amino acids Val, Tyr, Trp and Ser are preferred. 

Hie present inventors have also discovered that the polypeptides comprising at least one non-native 
10 Kunitz type domain which resulted in increased potency for plasma kallikrein, also inhibited the Factor XIa 
serine protease found in the human plasma. By contrast, the non-natural Kunitz type serine protease inhibitors 
were not inhibitors of FXIIa, FXa, thrombin, TF-FVIIa, or activated protein C. Additionally, most of the selected 
Kunitz inhibitor variants inhibited plasmin only slightly however, moderate (>60%) inhibition was observed for 
selected variants. 

15 According to a preferred aspect of the present invention, the polypeptides of the present invention 

comprise at least one non native Kunitz type domain and inhibit plasma kallikrein but do not inhibit plasmin. 
According to this aspect of the present invention, the polypeptides comprise a Kunitz type serine protease 
inhibitor domain wherein position 1 1 is Glu, Asp or Pro; positions 12-19 are Gly, His, Cys, Arg, Ala, Ala, His, 
and Pro, respectively and position 34 of the secondary binding loop is Val, Tyr, or Trp. Especially preferred 

20 among this group of polypeptides are polypeptides which comprise a Kunitz type serine protease inhibitor 
domain wherein position 1 1 is Asp. 

Hie invention therefore provides for a polypeptide which comprises at least one Kunitz-type domain 
having a primary and secondary binding loop as described. Hie remaining residues of the Kunitz type domain 
are selected from the naturally occurring amino acids such that the overall tertiary structure of the Kunitz type- 

2 5 domain is maintained. Accordingly, the cysteine residues characteristic of the Kunitz type domain are generally 

maintained at positions 5, 14, 30, 38, 5 1 and 55. 

In a preferred embodiment of the present invention the residues flanking the primary and secondary 
binding loop of the variant Kunitz type domain are selected from those residues that make up naturally occurring 
Kunitz type domains. For instance, a typical Kunitz type domain consists of 58 amino acids. According to the 
30 present invention amino acid residues 1 1-19 and 34 are selected as described to achieve potent inhibition of 
plasma kallikrein. The flanking residues, i.e., residues 1-10, 20-33 and 35-58, in addition to maintaining the 
proper cysteine residues as described, may be selected from corresponding residues of naturally occurring Kunitz 
domains such that the proper three-dimensional structure and the desired activity is maintained. 

In preferred aspects of the present invention the flanking residues 1-10, 20-33 and 35-58 are selected 

3 5 from the corresponding flanking regions of other Kunitz type serine protease inhibitors well known in the art. 

Such Kunitz type serine protease inhibitors include those described in Figure 2. Therefore, in a preferred 
embodiment, residues 1-10 are selected from VREVCSEQAE (SEQ ID NO: 6), MHSFCAFKAD (SEQ ID NO: 
7), KPDFCFLEED (SEQ ID NO: 8), GPSWCLTPAD (SEQ ID NO: 9), KEDSCQLGYS (SEQ ID NO: 10), 
TVAACNLPIV (SEQ ID NO: 1 1), LPNVCAFPME (SEQ ID NO: 12), and RPDFCLEPPY (SEQ ID NO: 13); 
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residues 20-33 are selected from the group consisting of RWYFDVTEGKCAPF (SEQ ID NO: 14), 
RFFFNIFTRQCEEF (SEQ ID NO: 15), RYFYNNQTKQCERF (SEQ ID NO: 16), RFYYNSVTGKCRPF 
(SEQ ID NO: 17), RYFYNGTSMACETF (SEQ ID NO: 18), LWAFDAVKGKCVLF (SEQ ID NO: 19), 
KWYYDPNTKSCARF (SEQ ID NO: 20), RWFFNFETGECELF (SEQ ID NO: 2 1 ), and RYFYNAKAGLOQTF 
5 (SEQ ID NO: 22); and residues 35-58 are selected from the group consisting of 
YGGCX3GNRNNFDTEEYCAAVCGSA (SEQ ID NO: 23), YGGCGGNRNNFDTEEYCMAVCGSA (SEQ ID 
NO: 24), YGGCEGNQNRFESLEECKKMCTRD (SEQ ID NO: 25), YGGCLGNMNNFETLEECKNICEDG 
(SEQ ID NO: 26), YSGCGGNENNFTSKQECLRACKKG (SEQ ID NO: 27), 
YGGCMGNGNNFVTEKECLQTCRTV (SEQ ID NO: 28), YGGCQGNGNKFYSEKECREYCGVP (SEQ ID 
10 NO: 29), YGGCGGNENKFGSQKECEKVCAPV (SEQ ID NO: 30), YGGCGGNSNNFLRKEKCEKFCKFT 
(SEQ ID NO: 3 1), and YGGCRAKRNNFKSAEDCMRTCGGA (SEQ ID NO: 32) which correspond to the 
equivalent residues in the Kunitz type serine protease inhibitor domains of APPI (residues 1-58) (SEQ ID NO: 
34) from human Alzheimer's disease amyloid P-protein precursor, residues 287-344 (Castro, M. et al. (1990) 
FEBS Lett 267:207-212); TFPI-KD1 (residues 22-79) (SEQ ID NO: 35), TFPI-KD2 (residues 93-150) (SEQ 
15 ^0:36), and TFPI-KD3 (residues 1 85-242) (SEQ ID NO: 37) of human TTPI (tissue factor protein inhibitor 
or LACI, Broze Jr., G. J. et al., (1990) Biochemistry 29:7539-7546); ITI-KD1 and IH-KD2, (residues 22-79 and 
78-135) (SEQ ID NO: 38 and 39) of human inter-a-trypsin inhibitor, respectively (Vetr, H. et al., (1989) FEBS 
Lett. 245:137-140); Collagen a 3 (VI) (residues 2899-2956) (SEQ ID NO: 40) Collagen alpha 3 (VI) chain 
precursor (Chu, M. L. (1990) et al. EMBO J. 9:385-393); HKIB9 (7-60) (SEQ ID NO: 41) Human Kunitz-type 
20 protease inhibitor, HKIB9 (Norris, K., in Genbank Database (Dec 31, 1993, Release 39.0), submitted Jan 
19, 1994); BPTI (1-58) (SEQ ID NO: 42), Aprotinin, bovine basic pancreatic trypsin inhibitor (Creighton T. E. 
and Charles, I. G., (1987) Cold Spring Harbor Symp. Quant Biol. 52:51 1-519) as shown in Figure 2. 

According to a preferred aspect of the invention the residues for the primary and secondary binding loop 
as described above are presented in the context of the flanking regions 1-10 (SEQ ID NO: 6), 20-33 (SEQ ID 
25 NO: 14) and 35-58 (SEQ ID NO: 23) of APPI. 

It will be understood by those of skill in the art that conservative amino acid substitutions such as those 
described above can be made throughout the polypeptides as described herein keeping in mind that the resultant 
polypeptides have the ability to potently inhibit plasma kallikrein as defined herein. 

The skilled artisan will recognize that a Kunitz type domain can appear within a larger functional 
3 0 protein. Hie serine protease inhibitor TTPI, for instance, contains 3 Kunitz-type domains. Therefore, the present 
invention provides for a por/peptide which comprises at least one Kunitz type domain that can be expected to 
potently inhibit plasma kallikrein as discussed above. 
Determination of Equilibrium Dissociation Constant 

According to the present invention the polypeptides comprising one or more non native Kunitz type 
35 domains as defined are capable of the potent inhibition of plasma kallikrein. The present inventors have 
provided for the amino acids that are found in the primary and secondary binding loops of Kunitz type domain 
variants that can be expected to potently inhibit plasma kallikrein. 

Potent inhibition occurs when the polypeptide has an apparent dissociation constant (Kj*) for plasma 
kallikrein of less than about 500 picomolar (pM). More preferably, the polypeptides of the present invention 
40 have a Kj for plasma kallikrein of less than about 300 pM and most preferably less than about 100 pM. 
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Apparent equilibrium dissociation constants (K*) can be determined using methods derived for tight- 
binding inhibitors (Bieth, J., (1974) Proteinase Inhibitors, vol:463-469; Williams, J. W. and Morrison, J. F., 
(1979) Methods Enzymol 63:437-467), assuming enzyme and inhibitor form a reversible complex with a 1:1 
stoichiometry as has been observed for the interaction of Kunitz domains with serine proteases (Bode, W. and 
5 R. Huber, (1992)§upra; Laskowski, M., Jr. and I. Kato, (1980) Annu. Rev. Biochem. supraY The data are fit by 
nonlinear regression analysis to fequation 1 : 



g J _ 'V ^£oj *Sj<l E o] ^ V + K i *)'-(*• [Eg] « [Iq]) (1) 



where V/V Q is the fractional activity (steady-state inhibited rate divided by the uninhibited rate), [£J is the total 
plasma kallikrein active site concentration, and [I Q ] is the total inhibitor concentration. 

10 By measuring apparent K* values with other relevant serine proteases found in human plasma, the 

relative specificities of naturally occurring Kunitz type serine protease inhibitors such as APPI, BPTI as well as 
polypeptides which comprise one or more non-native Kunitz domain can be determined. To aliquots of serial 
diluted polypeptide inhibitor, either activated protein C, thrombin, FXa, FXIa, FXIIa, tissue factor-FVIIa or 
plasmin can be added After incubation and addition of the appropriate substrate, plots of fractional activity 

15 versus inhibitor concentration are generated as described in the Example sections. 

According to the present invention selected variants, purified by trypsin affinity chromatography and 
reverse phase HPLC, potently inhibit plasma kallikrein, with apparent equiUbrium dissociation constants (K,-*) 
of less than about 300 pM. 

As a particular example one such mutant, KALI-DY, which differed from APPI at 6 key residues 

2 0 (11 Asp, 13His, 17Ala, 18His, 19Pro and 34Tyr), inhibited plasma kallikrein with a Kj* = 15 ± 14 pM, 
representing an increase in binding affinity of more than 10,000-fold compared to the naturally occurring serine 
protease inhibitor, APPI. 

Similar to naturally occurring Kunitz serine protease inhibitors such as APPI, the variants also inhibited 
Factor XIa with high affinity, with K* values ranging from ca. 0.3 to 15 nM; KALI-DY inhibited Factor XIa 
25 withaKj = 8.2±3.5nM. KALI-DY did not inhibit plasmin, thrombin, Factor Xa, Factor Xlla, activated 
protein C, or tissue factor-Factor Vila. Consistent with the protease specificity profile, KALI-DY and other non- 
native inhibitors did not prolong the clotting time in a prothrombin time assay, but did prolong the clotting time 
in an activated partial thromboplastin time assay >3.5-fold at 1 uM. 
CUftttjng Assays 

30 The increased affinity of the polypeptide inhibitors of the present invention for plasma kallikrein relative 

to native Kunitz type serine protease inhibitors such as BPTI and APPI is reflected in their ability to prolong 
clotting time in an activated partial thromboplastin time (APTT) assay. Preferred polypetide inhibitors within 
the present invention prolonged clotting time in the activated partial thromboplastic time assay but not in the 
prothrombin time (PT)assay. As will be recognized by one skilled in the art, the APTT assay is an indication 

35 of a measure of the intrinsic pathway of coagulation. Therefore, in a preferred embodiment polypeptides 
according to the present invention prolong clotting time in the APTT assay by at least 1 fold and generally 
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between about 2 and 4 fold. In a preferred embodiment the inhibitors of the present invention prolong clotting 
time by a factor of 3.5 fold. 
Chemical Synthesis 

One method of producing the Kunitz domain variants involves chemical synthesis of the protein, 
5 followed by treatment under oxidizing conditions appropriate to obtain the native conformation, that is, the 
correct disulfide bond linkages. This can be accomplished using methodologies well known to those skilled in 
the art (see Kelley, R. F. and Winkler, M. E. in Genetic Engin eering Princip l es and Methods. (Setlow, J. K., 
ed.)., Plenum Press, N.Y., (1990) vol. 12, pp. 1-19; Stewart, J. M. and Young, J. D. (1984) Solid Phase Peptide 
Synthesis, Pierce Chemical Co. Rockford, IL). 
1 0 Polypeptides of the invention, especially those containing 58 amino acid residues, may be prepared 

using solid phase peptide synthesis (Merrifield, (1964) J. Am. Chem. Soc., 85:2149; Houghten, (1985) Proc. 
Natl. Acad. Sci. USA 82:5132).* Solid phase synthesis begins at the carboxy-terminus of the putative peptide 
by coupling a protected amino acid to a suitable resin, as shown in Figures 1-1 and 1-2, on pages 2 and 4 of 
Stewart and Young supra. 

15 In synthesizing polypeptides of this invention, the carboxyl terminal amino acid, wiA a -amino group 

suitably protected, is coupled to a chloromethylated polystyrene resin (see Figure 1-4, page 10 of Stewart and 
Young, supra). After removal of the a-amino protecting group with, for example, trifluoroacetic acid (TFA) 
in methylene chloride and neutralizing in, for example TEA, the next cycle in the synthesis is ready to proceed. 
The remaining a -amino- and, if necessary, side-chain-protected amino acids are then coupled 

2 0 sequentially in the desired order by condensation to obtain an intermediate compound connected to the resin. 

Alternatively, some amino acids may be coupled to one another forming a peptide prior to addition of the peptide 
to the growing solid phase polypeptide chain. 

The condensation between two amino acids, or an amino acid and a peptide, or a peptide and a peptide 
can be carried out according to the usual condensation methods such as azide method, mixed acid anhydride 
25 method, DCC (dicyclohexylcarbodiimide) method, active ester method (p-nitrophenyl ester method, BOP 
[benzotriazole-l-yl-oxy-tris (dimethylamino) phosphonium hexafluorophosphate] method, N-hydroxysuccinic 
acid imido ester method), and Woodward reagent K method In the case of elongating the peptide chain in the 
solid phase method, the peptide is attached to an insoluble carrier at the C-terminal amino acid. For insoluble 
carriers, those which react with the carboxy group of the C-terminal amino acid to form a bond which is readily 

3 0 cleaved later, for example, halomethyl resin such as chloromethy 1 resin and bromomethyl resin, hydroxymethy 1 

resin, aminomethyl resin, benzhydrylamine resin, and t-alkyloxycarbonyl-hydrazide resin can be used. 

Common to chemical syntheses of peptides is the protection of the reactive side-chain groups of the 
various amino acid moieties with suitable protecting groups at that site until the group is ultimately removed after 
the chain has been completely assembled Also common is the protection of he a-amino group on an amino acid 

35 or a fragment while that entity reacts at the carboxyl group followed by the selective removal of the a-amino- 
protecting group to allow subsequent reaction to take place at that location. Accordingly, it is common that, as 
a step in the synthesis, an intermediate compound is produced which includes each of the amino acid residues 
located in the desired sequence in the peptide chain with various of these residues having side-chain protecting 
groups. These protecting groups are then commonly removed substantially at the same time so as to produce 

40 the desired resultant product following purification. 
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The applicable protective groups for protecting the a -and e -amino side chain groups are exemplified 
by benzyloxycarbonyl (abbreviated Z), isonicotinyloxycarbonyl (iNOC), O-chlorobenzyloxycarbonyl [Z(N0 2 ], 
p^methoxybenzyloxycarbonyl [Z(OMe)J, t-butoxycarbonyl, (Boc), t-amyioxycarbonyl (Aoc), 
isobornyloxycarbonyl, adamatyloxycarbonyl, 2-(4-biphenyl)-2-propyloxycarbonyl (Bpoc), 9- 
5 fluorenylmethoxycarbonyl (Fmoc), methylsulfonyiethoxycarbonyl (Msc), trifluoroacetyl, phthalyl, formyl, 2- 
nitrophenylsulphenyl (NPS), diphenylphosphinothioyl (Ppt), dimethylophosphinothioyl (Mpt) and the like. 

As protective groups for the carboxy group there can be exemplified, for example, benzyl ester (OBzi), 
cyclohexyl ester (Chx), 4-nitrobenzyl ester (ONb), t-butyl ester (Obut), 4-pyridybnethyl ester (OPic), and the 
like. It is desirable that specific amino acids such as arginine, cysteine, and serine possessing a functional group 
10 other than amino and carboxyl groups are protected by a suitable protective group as occasion demands. For 
example, the guanidino group in ajginine may be protected with nitro, p-toluenesulfonyl, benzyloxycarbonyl, 
adamantyloxycarbonyl, p-memoxybenzenesulfonyl, 4-methoxy-2, 6-dimethylbenzenesulfonyl (Mds), 1,3,5- 
trimethylphenysulfonyl (Mts), and the like. The thiol group in cysteine may be protected with p-methoxy benzyl, 
triphenyhnethyl, acetylaminomethyl ethylcarbamoyl, 4-methylbenzyl, 2, 4, 6-trimethy-benzyl (Tmb) etc, and the 
15 hydroxyl group in the serine can be protected with benzyl, t-butyl, acetyl, tetrahydropyranyl, etc. 

Stewart and Young, syjaa provides detailed information regarding procedures for preparing peptides. 
Protection of a-amino groups is described on pages 14-18, and side-chain blockage is described on pages 1 8-28. 
A table of protecting groups for amine, hydroxyl and sulfhydryl functions is provided on pages 149-151. 

After the desired amino acid sequence has been completed, the intermediate peptide is removed from 
20 the resin support by treatment with a reagent, such as liquid HF and one or more thio-containing scavengers, 
which not only cleaves the peptide from the resin, but also cleaves all the remaining side-chain protecting groups. 
Following HF cleavage, the protein sequence is washed with ether, transferred to a large volume of dilute acetic 
acid, and stirred at pH adjusted to about 8.0 with ammonium hydroxide. 

Preferably in order to avoid alkylation of residues in the polypeptide, (for example, alkylation of 

2 5 methionine, cysteine, and tyrosine residues) a thio-cresol and cresol scavenger mixture is used. The resin is 

washed with ether, and immediately transferred to a large volume of dilute acetic acid to solubilize and minimize 
intermolecular cross-linking. A 250 uM polypeptide concentration is diluted in about 2 liters of 0.1 M acetic 
acid solution. The solution is then stirred and its pH adjusted to about 8.0 using ammonium hydroxide. Upon 
pH adjustment, the polypeptide takes its desired conformational arrangement. 

3 0 Kunitz domains can be made either by chemical synthesis, described above, or by semisynthesis. The 

chemical synthesis or semisynthesis methods of making allow the possibility of non-natural amino acid residues 
to be incorporated This has been carried out for Kunitz domains and described (Beckmann, J. et al., (1988) Eur. 
J. Biochem. 176: 675-682; Bigler, T. L. et al., (1993) Prot Sci. 2: 786-799). 

For polypeptides containing one or more Kunitz type domains as described herein the chemical ligation 
3 5 techniques described in U.S. Patent 5,403,737 for chemically synthesizing large biomolecules are especially 
useful. 
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Mutagenesis and Synthetic Techn^ 

Various techniques are available which be employed to produce mutant DNA, which can encode the 
polypeptides of the present invention. For instance, it is possible to derive mutant DNA based on naturally 
wxurring DNA sequences that encode for changes in an amino acid sequence of the resultant protein relative 
5 to a native Kunitz type domain such as the APPI molecule. These mutant DNA can be used to obtain the 
polypeptides of the present invention. 

By way of illustration, with expression vectors encoding APPI (Castro et al., supra ) or other naturally 
occurring Kunitz domain polypeptides in hand, site specific mutagenesis (Kunkel et al., (1991) Methods 
Enzymol. 204:125-139; Carter, P, et al., (1986) Nucl. Acids. Res. 13:4331; Zoller, M. J. et al, (1982) Nucl 
10 Acids Res. 10:6487), cassette mutagenesis (Wells, J. A., et al., (1985) Gene 34:315), restriction selection 
mutagenesis (Wells, J. A., et al, (1986) Philos. Trans, R. Soc. London Ser A 317, 415) or other known 
techniques may be performed on the DNA. The mutant DNA can then be used in place of the parent DNA by 
insertion into the appropriate expression vectors. Growth of host bacteria containing the expression vectors with 
the mutant DNA allows the production of Kunitzrtype serine protease inhibitor variants which can be isolated 
15 as described herein. 

For polypeptides containing one or more Kunitz type domains as described herein the Kunitz type 
domain can be ligated into a larger biomolecule using the chemical ligation techniques described in U.S. Patent 
5,403,737 for chemically synthesizing large biomolecules for example. 

Oligonucleotide-mediated mutagenesis is a preferred method for preparing Kunitz type variants of the 
2 0 present invention. This technique is well known in the art as described by Adelman et al., (1983) DNA. 2: 183. 
Briefly, the native or unaltered DNA of a native Kunitz type domain, for instance APPI, is altered by hybridizing 
an oligonucleotide encoding the desired mutation to a DNA template, where the template is the single-stranded 
form of a plasmid or bacteriophage containing the unaltered or native DNA sequence of APPI. After 
hybridization, a DNA polymerase is used to synthesize an entire second complementary strand of the template 
that will thus incorporate the oligonucleotide primer, and will code for the selected alteration in the Kunitz 
domain subunit DNA. 

Generally, oligonucleotides of at least 25 nucleotides in length are used; An optimal oligonucleotide 
will have 12 to 15 nucleotides that are completely complementary to the template on either side of the 
nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will hybridize properly to the 
single-stranded DNA template molecule. The oligonucleotides are readily synthesized using techniques known 
in the art such as those described by Crea et al. (1987) Proc. Natl. Acad. Sci. USA, 75:5765. 

Single-stranded DNA template may also be generated by denaturing double-stranded plasmid (or other) 
DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid sequence variants, for example), the 

3 5 oligonucleotide is hybridized to the single-stranded template under suitable hybridization conditions. A DNA 

polymerizing enzyme, usually the Klenow fragment of DNA polymerase I, is then added, to synthesize the 
complementary strand of the template using the oligonucleotide as a primer for synthesis. A heteroduplex 
molecule is thus formed such that one strand of DNA encodes the mutated form of the Kunitz domain, and the 
other strand (the original template) encodes the native, unaltered sequence of the Kunitz domain. Hiis 

4 0 heteroduplex molecule is then transformed into a suitable host cell, usually a prokaryote such as £. coli 27C7. 
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After the cells are grown, they are plated onto agarose plates and screened using the oligonucleotide primer 
radiolabeled with 32-phosphate to identify the bacterial colonies that contain the mutated DNA. The mutated 
region is then removed and placed in an appropriate vector for protein production, generally an expression vector 
of the type typically employed for transfonnation of an appropriate host. 
5 • The method described immediately above may be modified such that a homoduplex molecule is created 

wherein both strands of the plasmid contain the mutation(s). TTie modifications are as follows: The 
single-stranded oligonucleotide is annealed to the single-stranded template as described above. A mixture of 
three deoxyribonucleotides, deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP), and deoxyribothymidine 
(dTTP), is combined with a modified thio-deoxyribocytosine called dCTP-(aS) (which can be obtained from 

10 Amersham Corporation). This mixture is added to the template-oligonucleotide complex. Upon addition of 
DNA polymerase to this mixture, a strand of DNA identical to the template except for the mutated bases is 
generated. In addition, this new strand of DNA will contain dCTP-(aS) instead of dCTP, which serves to protect 
it from restriction endonuclease digestion. 

After the template strand of the double-stranded heteroduplex is nicked with an appropriate restriction 

15 enzyme, the template strand can be digested with ExoIII nuclease or another appropriate nuclease past the region 
that contains the site(s) to be mutagenized. The reaction is then stopped to leave a molecule that is only partially 
single-stranded. A complete double-stranded DNA homoduplex is then formed using DNA polymerase in the 
presence of all four deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex molecule can 
then be transformed into a suitable host cell such as R coli 27C7, as described above. 

20 DNA encoding Kunitz domain variants with more than one amino acid to be substituted may be 

generated in one of several ways. If the amino acids are located close together in the polypeptide chain, they may 
be mutated simultaneously using one oligonucleotide that codes for all of the desired amino acid substitutions. 
If, however, the amino acids are located some distance from each other (separated by more than about ten amino 
acids), it is more difficult to generate a single oligonucleotide that encodes all of the desired changes. Instead, 

2 5 one of two alternative methods may be employed. 

In the first method, a separate oligonucleotide is generated for each amino acid to be substituted. The 
oligonucleotides are then annealed to the single-stranded template DNA simultaneously, and the second strand 
of DNA that is synthesized from the template will encode all of the desired amino acid substitutions. 

The alternative method involves two or more rounds of mutagenesis to produce the desired mutant. The 

3 0 first round is as described for thcsingle mutants: wild-type DNA is used for the template, an oligonucleotide 

encoding the first desired amino acid substitution(s) is annealed to this template, and the heteroduplex DNA 
molecule is then generated. The second round of mutagenesis utilizes the mutated DNA produced in the first 
round of mutagenesis as the template. Thus, this template already contains one or more mutations. The 
oligonucleotide encoding the additional desired amino acid substitution(s) is then annealed to this template, and 

35 the resulting strand of DNA now encodes mutations from both the first and second rounds of mutagenesis. This 
resultant DNA can be used as a template in a third round of mutagenesis, and so on. 

A preferred vector for the recombinant expression of Kunitz-domain variants is pSAlzl. TOs vector, 
as described in Example 1 , contains origins of replication for E. coli, the alkaline phosphatase promoter, the stll 
signal sequence and an APPI variant gene, and the ampicillin resistance gene. Other preferred vectors are 

40 pB0475, pRITS and pRlT2T (Pharmacia Biotechnology). These vectors contain appropriate promoters 
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followed by the Z domain of protein A, allowing genes inserted into the vectors to be expressed as fusion 
proteins. Further discussion of these vectors may be found below. 

Other preferred vectors can be constructed using standard techniques by combining the relevant traits 
of the vectors described herein. Relevant traits of the vector include the promoter, the ribosome binding site, 
5 the APPI variant gene or gene fusion (the Z domain of protein A and APPI variant and its linker), the signal 
sequence, the antibiotic resistance markers, the copy number, and the appropriate origins of replication. 

In E. coli, Kunitz domains have been expressed as intact secreted proteins (Castro, M. et al., (1990) 
FEBS Lett. 267207-212), intracellularly expressed proteins (Altman, J. D. et al., (1991) Protein Eng. 4:593- 
600), or as fusion proteins (Sinha, S. et al., (1991) J. BioL Chem. 2662101 1-21013; Lauritzen, C. et al., (1991) 
10 Prot Express. Purif. 2:372-378; Auerswald, E. A. et al.,(1988) Biol. Chem. Hoppe-Seyler 369:27-35). 

The host cell may be prokaryotic or eukaryotic. Prokaryotes are preferred for cloning and expressing 
DNA sequences to produce parent polypeptides, segment substituted polypeptides, residue-substituted 
polypeptides and polypeptide variants. For example, £ coli K12 strain 294 (ATCC No. 3 1446) may be used 
as E, coli B, £. coli XI 776 (ATCC No. 3 1 537), and E coli c600 and c600hfl, E. coli W3 1 10 (F-, gamma-, 
15 prototrophic /ATCC No. 27325), bacilli such as Bacillus subtilis, and other enterobacteriaceae such as 
Salmonella -typhimurium or Serratia marcesans, and various pseudomonas species. The preferred prokaryote 
is E. coli W31 10 (ATCC 27325) as well as the non suppressor derivative 27C7 (ATCC 55,244). When 
expressed by prokaryotes the polypeptides typically contain an N-terminal methionine or a formyl methionine 
and are not glycosylated. In the case of fusion proteins, the N-terminal methionine or formyl methionine resides 
20 on the ammo terminus of the fusion protein or the signal sequence of the fusion protein. These examples are, 
of course, intended to be illustrative rather than limiting. 

In addition to prokaryotes, eukaryotic organisms, such as yeast cultures, or cells derived from 
multicellular organisms may be used. In principle, any such cell culture is workable. However, interest has been 
greatest in vertebrate cells, and propagation of vertebrate cells in culture (tissue culture) has become a 
25 reproducible procedure (Tj^e Culture, Kruse and Patterson, eds.(1973) Academic Press). Examples of such 
useful host cell lines are VERO and HeLa cells, Chinese Hamster Ovary (CHO) cell lines, W138, 293, BHK, 
COS-7 and MDCK cell lines. Yeast expression systems have been used to make Kunitz domains (Wagner, S. 
L. et al., (1992) Biochem. Biophys. Res. Commun. 186:1138-1145; Vedvick, T. et al., (1991) J. Indust. 
Microbiol. 7:197-202). In particular the yeast Pichia pastoris has been used successfully using the 
Saccharomyces cerevisiae a mating factor prepro signal sequence and the />. pastoris alcohol oxidase AOX1 
promoter and terminator sequences. Other yeast expression vectors and hosts commonly used to express 
heterologous proteins are also contemplated. 
Gene Feigns 

A variation on the above procedures contemplates the use of gene fusions, wherein the gene encoding 
35 the APPI variant is associated, in the vector, with a gene encoding another protein or a fragment of another 
protein. This results in the APPI variant being produced by the host cell as a fusion with another protein. Hie 
"other" protein is often a protein or peptide which can be secreted by the cell, making it possible to isolate and 
purify the desired protein from the culture medium and eliminating the necessity of destroying the host cells 
which arises when die desired protein remains inside the cell. Alternatively, the fusion protein can be express 
4 0 intracellularly. It is useful to use fusion proteins that are highly expressed. 
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The use of gene fusions; though not essential, can facilitate the expression of heterologous proteins in 
£. coli as well as the subsequent purification of those gene products (Harris, T. J. R. in Genetic Engineering , 
Williamson, R., eds.,(1983) Academic, London, Vol. 4, p. 127; Uhlen, M and Moks, T., (1990) Methods 
EnzymoL 185:129-143). Protein A fusions are often used because the binding of protein A, or more specifically 
5 the Z domain of protein A, to IgG provides an "affinity handle" for the purification of the fused protein (Nilsson, 
B. and Abrahmsen, L. (1990) Methods EnzymoL 185:144-161). It has also been shown that many heterologous 
proteins are degraded when expressed directly in £. coli, but are stable when expressed as fusion proteins 
(Marston, F. A. O., (1986) Biochem J., 240:1). 

APPI variants expressed as fusion proteins may be properly folded or may require folding to obtain the 

10 native structure. The properly folded fusion protein may be active and useful as a serine protease inhibitor. 
More preferred would be the correctly folded protein mat is obtained from the fusion protein by methods known 
in the art. Fusion proteins can be cleaved using chemicals, such as cyanogen bromide, which cleaves at a 
methionine, or hydroxylamine, which cleaves between an Asn and Gly. Using standard recombinant DNA 
methodology, the nucleotide base pairs encoding these amino acids may be inserted just prior to the 5* end of the 

15 APPI variant gene. 

Alternatively, one can employ proteolytic cleavage of fusion proteins, which has been recently reviewed 
(Carter, P. in Protein P urification; From Molecular Mechanisms to Large-Scale Processes Ladisch, M. R., 
Willson, R. C, Painton, C. C, and Builder, S. E., eds., (1990) American Chemical Society Symposium Series 
No. 427, Ch 13, pp. 181-193). 

2 0 Proteases such Factor Xa, thrombin, subtilisin and mutants thereof, have been successfully used to 

cleave fusion proteins. Typically, a peptide linker that is amenable to cleavage by the protease used is inserted 
between the "other" protein (e.g., the Z domain of protein A) and the protein of interest, such as an APPI variant. 
Using recombinant DNA methodology, the nucleotide base pairs encoding the linker are inserted between the 
genes or gene fragments coding for the other proteins. Proteolytic cleavage of the partially purified fusion 

2 5 protein containing the correct linker can then be carried out on either the native fusion protein, or the reduced 

or denatured fusion protein. 

The protein may or may not be properly folded when expressed as a fusion protein. Also, the specific 
peptide linker containing the cleavage site may or may not be accessible to the protease. These factors determine 
whether the fusion protein must be denatured and refolded, and if so, whether these procedures are employed 

3 0 before or after cleavage. 

When denaturing and refolding are needed, typically the protein is treated with a chaotrope, such as 
guanidine HC1, and is then treated with a redox buffer, containing, for example, reduced and oxidized 
dithiothreitol or glutathione at the appropriate ratios, pH, and temperature, such that the protein of interest is 
refolded to its native structure. 
35 Utility 

It has been suggested that the contact activation system plays a significant role in a variety of clinical 
states including septic shock, cardiopulmonary bypass surgery, adult respiratory distress syndrome, and 
hereditary angioedema (Bone, R.C., (1992), Arch. Intern. Med. 152: 138M389; Colman, R.W., (1989) N Engl. 
J. Med. 320:1207-1209). Inhibitors of the contact system may therefore play important roles in the regulation 
40 of inflammatory and/or thrombotic disorders. 
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The polypeptides described herein are useful in the treatment of diseases where intervention in the 
activation of the contact pathway or neutrophil activation is indicated (e.g. inflammation, coagulation, 
fibrinolysis, and complement activation). More specifically, the instant inhibitors are especially useful in the 
treatment of diseases where inhibition of plasma kallikrein (and FXIa) is indicated (see Figure 1) as for example 
5 in the treatment of sepsis or septic shock, inflammation, ARDS, DIC, hypotension, cardiopulmonary bypass 
surgery, and for bleeding from postoperative surgery as described in detail in the background section. 

The polypeptides described herein are suitably useful in clinical situations that require acute or chronic 
therapy. It is anticipated that indications for which acute therapy is indicated are more preferred than those for 
chronic therapy. The pharmaceutical use of foreign or mutant human proteins may be immunogenic; however 

10 foreign proteins are used to treat acute indications. An example of such a protein is streptokinase, a protein 
derived from streptococci that acts as a fibrinolytic and is commonly used to treat acute myocardial infarction. 
The agents described herein may elicit an immune response; however related foreign proteins such as BPTI have 
been used in humans clinically and are not anticipated to elicit a serious immune response. The covalent 
attachment of polyethylene glycol (PEG) to the agents described herein may reduce the immunogenicity and 

15 toxicity, and prolong the half-life as has been observed with other proteins (Katre N. V., (1990) J. Immunol. 
144:209-213; Poznansky, M. J. et al., (1988) FEB 239:18-22; Abuchowski, A. et al, (1977) J. Biol. Chem. 
252:3582-3586) 

Aprotinin inhibits the contact, neutrophil, and platelet activation systems during simulated 
extracorporeal perfusion as evidenced by a reduction in blood loss and kallikrein-Cl -inhibitor and Cl-Cl- 
20 inhibitor complexes, as well as prevention of neutrophil degranulation, platelet activation and aggregation 
(Westaby, S., (1993) Ann. Thor. Sfurg. 55: 1033-1041 ; Wachtfogel, Y., et al., (1993) J. Thorac. Cardiovasc. Surg. 
106:1-10). It has been used during LPS induced endotoxic shock in pigs and prevented arterial hypotension 
(Seibeck, M, et al., (1993) J. Trauma 34: 193-198). In patients with hepatic cirrhosis, aprotinin has resulted in 
improved renal function and filtration (MacGilchrist A„ (1994) Clin. Sci, 87:329-335). The plasma kallikrein 

2 5 Kunitz domain inhibitors described here may be suitably used in the treatment of these and related indications. 

The polypeptides of the present invention can be therapeutically useful in the modulation of functions 
mediated by plasma kallikrien just as aprotinin is used. The present polypeptides offer the advantage of 
increased potency and specificity for plasma kallikrein allowing for low dose formulations. Effective doses of 
the polypeptides of the present invention are determined according to the relevant techniques. The selection of 

3 0 compositions, frequency of administration, and amount of composition so administered will be in accordance 

with the particular disease being treated and its severity, the nature of the polypeptide employed, the overall 
condition of the patient, and the judgement of the treating physician. Topical dosing regimes will be analogous 
to treatment of these disease states by the use of other analogous proteins such as aprotinin. Typically, the 
compositions of the instant invention will contain from about 1% to about 95% of the active ingredient, 
3 5 preferably about 1 0% to about 50%. 

Preferably, the dosing will be by intravenous injection or short term infusion. To acheive optimal 
therapeutic effect, maintenance dosing may be required. Such maintenance dosing may be given repeatedly 
during the course of a day by, for instance repeated individual injections or by introduction into a continuous drip 
infusion. 

* 
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For intravenous injection or short term infusion, generally between about 1 and 1,000 mg will be 
administered to an adult and preferably between about 1 and 100 mg. Maintenance dosing at approximately 0.5 
to about 50 mg is anticipated. 

Other effective dosages can be readily determined by one of ordinary skill in the art through routine 
5 trials establishing dose response curves. 

Pharmaceutical compositions which comprise the polypeptides of the invention may be administered 
in any suitable manner, including parental, topical, oral, or local (such as aerosol or transdermal) or any 
combination thereof. The compositions are preferably administered with a pharmaceutically acceptable carrier, 
the nature of the carrier differing with the mode of administration, for example, in oral administration, usually 
10 using a solid carrier and in I.V. administration, a liquid salt solution carrier. 

The compositions of the present invention include pharmaceutically acceptable components that are 
compatible with the subject and the protein of the invention. These generally include suspensions, solutions and 
elixirs, and most especially biological buffers, such as phosphate buffered saline, saline, Dulbecco's Media, and 
the like. Aerosols may also be used, or carriers such as starches, sugars, microcrystalline cellulose, diluents, 
1 5 granulating agents, lubricants, binders, disintegrating agents, and the like (in the case of oral solid preparations, 
such as powders, capsules, and tablets). 

As used herein, the term "pharmaceutically acceptable" generally means approved by a regulatory 
agency of the Federal or a state government or listed in the U.S. Pharmacopeia or other generally recognized 
pharmacopeia for use in animals, and more particularly in humans. 
20 Ihe formulation of choice can be accomplished using a variety of the aforementioned buffers, or even 

excipients including, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, 
sodium saccharin cellulose, magnesium carbonate, and the like. "PEGylation" of the compositions may be 
achieved using techniques known to the art (see for example International Patent Publication No. W092/16555, 
U.S. Patent No. 5,122,614 to Enzon, and International Patent Publication No. WO92/00748). Oral compositions 

2 5 may be taken in the form of solutions, suspensions, tablets, pills, capsules, sustained release formulations, or 

powders. 

The present invention has of necessity been discussed by reference to certain specific methods and 
materials. It is to be understood that the discussion of these specific methods and materials in no way constitutes 
any limitation on the scope of the present invention, which extends to any and all alternative materials and 

3 o methods suitable for accomplishing die ends of the present invention. 

Example 1 

Construction and Purification of Plasma Kallik rein Inhibitor* 

Methods 

Human Factor Vila, Factor Xa, Factor XIa, activated protein C, and thrombin were purchased from 
35 Haematologic Technologies Inc. (Essex Jet., VT). Human plasma kallikrein and Factor Xlla were purchased 
from Enzyme Research Laboratories, Inc. (South Bend, IN). The gene encoding the APPI sequence is described 
in Castro, M. et a]., (1990) FEBS Lett 267:207-212; recombinant human tissue factor U243 (TF) was produced 
in K coli as described in Paborsky, L.R., et aL, (1989) Biochemistry 28:8072-8077 and Paborsky, L.R., et al., 
(1991) J. Biol. Chem. 266:21911-21916. BPTI (Irasylol®) was obtained from Boehringer Mannheim 
40 (Indianapolis, IN). Bovine trypsin and TRITON® X-100 were purchased from Sigma Chemicals, Inc. Bovine 
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serum albumin (BSA), Fraction V was obtained from CaJbiochem (La Jolla, CA). Human plasm in, S-2302 S- 
2251, and S-2366 were purchased from Kabi Vitrum (Sweden) and Spectrozyme FXa was purchased from 
American Diagnostica (Greenwich, CT). R coli strain XL 1 -Blue was from Stratagene (La Jolla, CA). All other 
reagents obtained were of the highest grade commercially available. 
5 The plasmid pSAlzl was constructed by inserting a synthetic gene encoding the APPI sequence into 

an appropriate expression vector for secretion of APPI into the periplasm and media. The pSAlzl vector 
contained the alkaline phosphatase promoter, stll secretion signal, the APPI gene, the fl and colEl origins of 
replication, and the ampicillin resistance gene as described by Castro et al. (Castro, M. et al., (1990) supraV The 
construction of APPI mutants using the pSAlzl vector was accomplished using site-directed oligonucleotide 
10 mutagenesis as previously described (Kunkel, T. A. et aL, (1991) Methods Enzymol. 204:125-139); selected 
clones were analyzed by dideoxy sequence analysis (Sanger, F. et al., (1977) Proc. Natl. Acad. ScL USA 
74:5463-5467). 

Phagemids encoding eijher APPI or the selected mutants were transformed into E. coli strain 27C7, a 
derivative of K coli W3 1 1 0, for expression of the Kunitz domain inhibitors. Overnight saturated cultures were 

15 inoculated (1%) into 250 ml of low phosphate minimal media (Chang, C. N. et al., (1987) Gene 55:189-196) 
containing 50 ug/ml ampicillin and grown for 20 h at 37 °C. 

Inhibitors were secreted into the periplasm by virtue of the stll signal sequence and eventually leaked 
into the media. Cells and debris were removed by centrifugation (8000 x g, 1 0 min); the supernatant was adjusted 
to pH 7.5 - 8.5 with 1 M NaOH and men loaded onto a 1 ml trypsin-Affigel 1 0 (Bio-Rad Laboratories, Richmond, 

20 CA) affinity column which was prepared according to the manufacturer's recommendations. The column was 
washed with 100 mM Tris pH 8, 100 mM NaCl, and 20 mM CaCl 2 and inhibitors were eluted with 4 ml of 10 
mM HC1, 0.5 M KC1. The inhibitors were further purified using CI 8 reverse phase HPLC (250 x 4.6 mm, 
VYDAC); they were loaded in 0.1% trifluoroacetic acid and eluted with a CH3CN gradient from 5 to 40 % at 
1 ml/min. Ehition profiles were monitored at both A 214 and A 280 . A single well resolved peak was detected 

25 for each inhibitor between 30 to 35 % CH3CN. Inhibitor sequences were verified for the proper mass using a 
Sciex API 3 mass spectrometer equipped with an articulated electrospray source for mass analysis. Multiply 
charged ions of horse myoglobin" (MW = 16,951 Da) were used for instrument calibration. 



Site^lirected mutants were made to investigate the preferred amino acids at positions 1 1-19 and 34 of 
30 a Kunitz domain for the potent inhibition of plasma kallikrein. The site directed mutants of APPI have the 
general formula: 

R 1 -Xaa 5 -Xaa 4 -Xaa 3 -Xaa 2 -Xaa 1 -Xaa 1 • -Xaa 2 • -Xaa 3 * -Xaa 4 • -R 2 -Xaa 19 • -R 3 
wherein K } represents amino acid residues 1-10 of APPI, VREVCSEQAE (SEQ ID NO: 6); 1^ represents amino 
acid residues 20-33 of APPI, RWYFDVTEGKCAPF (SEQ ID NO: 14); and R3 represents amino acid residues 
35 35 through 58 of APPI, YGGCGGNRNNFDTEEYCAAVCGSA (SEQ ID NO: 23XSEQ ID NO: 23). The APPI 
sequence used in this example has an additional mutation of residue 52 so that Met in the native sequence is 
replaced with Ala. This mutation is not believed to have any effect on inhibitory activity. 
The non-naturally occurring Kunitz domains obtained are listed in Table I below. 
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Table I 

Sequences of Kunitz Domain Mm-ani-. 



Inhibitor 








KALI-D 


R 1 DGHCRAAHPR 2 FR3 


(SEQ 


ID NO: 


45) 


KALI-DV 


R 1 DGHCRAAHPR 2 VR3 


(SEQ 


ID NO: 


46) 


KALI-DY 


R 1 DGHCRAAHPR 2 YR 3 


(SEQ 


ID NO: 


47) 


KALI-P 


Ri PGHCRAAHPRo PR-> 


(SEQ ID NO: 


48) 


KALI-PV 


p„ PGHPRAAHPP^VP-. 
n 1 run wiuwtnr a n vxso 


(SEQ 


ID NO: 


49) 


KALI-PY 


R 1 PGHCRAAHPR 2 YR3 


(SEQ 


ID NO: 


50) 


KALI-E 


Ri EGHCRAAHPRi FR i 


(SEQ 


ID NO: 


51) 


KALI-EV 


p- EGHCRAAHPRoVR-a 


(SEQ 


ID NO: 


52) 


KALI-EY 




(SEQ 


ID NO: 


53) 


KALI-13 


R-i SGHCRAAIPRoPR-* 


(SEQ 


ID NO: 


1) 


KALI-48 


i VaXlUK>4/iJ.FK2 FK3 


(SEQ ID NO: 


2) 


KALI -8 


R 1 LGHCRAAIPR 2 FR 3 


(SEQ 


ID NO: 


3) 


KALI- 10 


R 1 DGPCRAAIPR 2 FR 3 


(SEQ ID NO: 


4) 


KALI-30 


R 1 EGHCRAAILR 2 FR 3 


(SEQ 


ID NO: 


5) 


KALI-46 


R 1 EGRCRASILR 2 FR 3 


(SEQ 


ID NO: 


33) 


KALI-38 


R 1 TGPCRALHSR 2 YR 3 


(SEQ 


ID NO: 


43) 


KALI -42 


R 1 TGPCRAAHSR 2 VR 3 


(SEQ 


ID NO: 


44) 



The Kunitz domain variants listed in Table I generally led to potent and selective inhibition of plasma 
kallikrein (see Examples 2 and 3). 

Other sequences which inhibit plasma kallikrein are described in Table II below. 
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Table n 



Inhibitor 
Kali- 19 
Kali- 12 

5 Kali-26 

Kali-31 

Kali-22 

Kali-25 

Kali-7 

10 Kali-29 
Kali-23 
Kali-21 
Kali-17 
Kali-28 

15 Kali-32 
Kali-35 
Kali-39 
Kali-41 
Kali-33 

20 Kali-36 



R j DGHCRAAIPR2FR3 

r 1 dgpcraaipr 2 fr 3 

R x DGPCRAAIPR 2 lR 3 

R 1 DGRCRAAIPR 2 FR 3 

R 1 EGTCRANIYR 2 FR 3 

R 1 LGGCRAWILR 2 FR 3 

R 1 PGHCRAAIPR 2 FR 3 

R 1 PGLCRAAFPR 2 FR 3 

R 1 PGLCRAAIYR 2 FR 3 

R 1 PGLCRALIWR 2 FR 3 

R 1 PGRCRAAIPR 2 FR 3 

R 1 RGHCRAAIPR 2 FR 3 

R 1 TGPCRAAHSR 2 VR 3 

R 1 TGPCRAAHSR 2 YR 3 

R 1 TGPCRGAHSR 2 VR 3 

R 1 TGPCRGAHSR 2 WR 3 

r 1 tgpcralhsr 2 yr 3 
r 1 tgpcranhsr 2 sr 3 



(SEQ. ID NO: 54) 
(SEQ.IDNO: 55) 

(SEQ. ID NO: 56) 
(SEQ. ID NO: 57) 
(SEQ.IDNO: 58) 
(SEQ. ID NO: 59) 
(SEQ. ID NO: 60) 
(SEQ. ID NO: 61) 
(SEQ.IDNO: 62) 
(SEQ.IDNO: 63) 
(SEQ. ID NO: 64) 
(SEQ.IDNO: 65) 
(SEQ. ID NO: 66) 
(SEQ. ID NO: 67) 
(SEQ. ID NO: 68) 
(SEQ.IDNO: 69) 
(SEQ. ID NO: 70) 
(SEQ. ID NO: 71) 



25 



30 



35 



Methods 



Example 2 

Determination of Equilibrium Dissoci ation Constants 



in 



Apparent equilibrium dissociation constants (Kj*) were determined for the Kunitz-domain mutants listed 
Table I. The K^ values were determined using methods derived for tight-binding inhibitors (Bieth, J. (1 974) 
in Proteinase fchfritQrs (Fritz, H., Tschesche, H., Greene, L.J., and Truscheit, E., eds), pp. 463-469, Springer- 
Verlag, New York; Williams, J. W., and Morrison, J.F., (1979) supra) , assuming enzyme and inhibitor form a 
reversible complex with a 1:1 stoichiometry as has been observed for the interaction of Kunitz domains with 
serine proteases (Bode, W., and Huber, R., (1992) sm&l Laskowski, M.,Jr„ and Kato, I., (1980) supra . 

Concentrations of the inhibitor stocks were accurately determined by titration with active site-titrated 
trypsin (Jameson, G.W.,etal, (1973) Biochem. J., 131:107-117). After l h incubation of 80 hM trypsin plus 
an aliquot of diluted inhibitor in 50 tnM Tris (pH 8.0), 100 mM NaCl, 10 mM CaC^ and 0.05% Triton X-100 
at room temperature, 20 ul of 5mM N a -benzoyl-L-arginine-/7-nitroanilide was added to a total volume of 150 
Ml. The change in absorbance at 405 nM was then monitored The concentrations determined assumed a 1 : 1 
stoichiometry of inhibitor with trypsin. 
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The inhibition of plasma kallikrein by selected APPI mutants (Table I) was determined at 25 °C in 50 
mM Tris, pH 7.5, 100 mM NaCl, 2 mM CaCl 2 and 0.005% Triton X-100 using an aliquot of the same diluted 
inhibitor solutions. Reactions (200 jiL) were carried out in microtiter plates and the rate substrate (0.7 mM 
S2302) was monitored at 405 nm following a 1.5 h incubation. Plots of the fractional rate versus inhibitor 
5 concentration were fit by nonlinear regression analysis to Equation 1 to determine apparent equilibrium 
dissociation constants (Kj*) : ' 
Equation 1: 

v/v - tEJ +[I 0 ] *-J ( [E 0 ] +[I 0 ] *)*-(4* Tij - [IjJ (1) 
1 0 . 2*[EJ 

where V/V Q is the fractional activity (steady-state inhibited rate divided by the uninhibited rate), [E 0 ] is the total 
1 0 plasma kallikrein active site concentration, and [I 0 ] is the total inhibitor concentration. 
Results 

The Kunitz domain inhibitors described in Table I had apparent dissociation constants in the range of 
10 to 300 pM (Table III and Table IV). 

The consensus sequence of Table III below consisting of Pro, Asp or Glu at position 1 1 (P^), His at 
15 position 13 (P 3 ), Aig at position 15 (Pj), Ala at positions 16 (Pj 1 ) and 17 (P 2 '), His at position 18 (P 3 *), Pro at 
position 19 (P 4 f ) and Val or Tyr at position 34 (P 19 ') of APPI was developed based on several observations from 
the Table II polypeptides, as well as the results presented in Table IV. Variants of APPI with Arg at position 
15 (Pj) and His, Ala, His and Pro at positions 13, 17, 18 and 19 respectively, were found to potently inhibit 
plasma kallikrein. In these variants, a preference for Pro, Asp or Glu was observed at position 1 1 (P 5 ), and at 
2 0 position 34 (P^O Val or Tyr were preferred The cysteine at position 14 (P 2 ) which forms a disulfide bond with 
the cysteine at position 38 (P23) of Kunitz type serine protease inhibitors remained unaltered. Since Gly is 
almost always found at position 12 (P 4 ) in Kunitz type domains, it was not varied. 

In particular, of the consensus type Kunitz domains, mutants containing an Asp at position 1 1 had 
apparent equilibrium dissociation constants below 50 pM. One such mutant, KALI-DY, inhibited plasma 
25 kallikrein over 10,000-fold better than APPI, and 3,000-fold more potently than BPTI (aprotinin, Trasylol®). 
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Example 3 
Specificity Assays 

Method 

Assays to test the specificity of APPI, BPTT, and Kunitz domain variants against other serine proteases 
5 involved in coagulation were conducted using the following format. Aliquots (30 uL) of various inhibitors 
diluted to 500 nM were incubated with each protease (100 uL) in the appropriate buffer. After incubation of the 
substrate/irihfoitor mixes at room temperature for 2 h f the appropriate substrate (20 uL) was added, and the 
absorbance at 405 nm was monitored. Controls lacking inhibitor and enzyme were assayed to measure the 
uninhibited and substrate hydrolysis rates, respectively. The enzymes and substrates were screened in 50 mM 

10 Tris, pH 7.5, 100 mM NaCl, 2 mM CaCl 2 and 0.005% Triton X-100 as follows: thrombin (6.2 nM), 0.7 mM 
S2366; FXa (2.5 nM), 0.7 mM Spectrozyme fXa; FXIa (1.8 nM), 0.7 mM S2366; activated protein C (7.6 nM), 
0.7 mM S2366; plasmin (32 nM), 0.7 mM S2251; Factor Xlla (14 nM), 0.7 mM S2302, TF (77 nM)-FVUa (14 
nM), 0.7 mM S2366 and plasma kallikrein (3.5 nM), 0.7 mM S2302. Hie FXIa assay also contained 1 mg/mL 
BSA. The concentration of FXIa throughout refers to the concentration of active sites. For this experiment, the 

15 concentrations of thrombin, TF-FVIIa, FXa, FXIa, FXIIa, and plasmin are approximate and are determined based 
upon the manufacturers* specifications. 
Results 

The inhibition of other relevant serine proteases found in human plasma was also measured to determine the 
relative specificity of the Kunitz domain inhibitors described in Example 2. Serine proteases (1 to 20 nM) were 
20 assayed in the presence of 100 nM inhibitor. The traction of remaining proteolytic activity is reported in Table 
IV. All of the selected Kunitz domains mutants inhibited FXIa, whereas none of mem appreciably inhibited 
FXIIa, FXa, thrombin, TF-FVIIa, or activated protein C (Table IV). Most of the selected Kunitz inhibitors 
inhibited plasmin only slightly and moderate (>60%) inhibition was observed for KALI-38, KALI-42 and KALI- 
48; however, the consensus mutants of Example 2 (Table III) did not appreciably inhibit plasmin. The degree 
to which FXIa was inhibited by selected inhibitors including the consensus mutants was further investigated by 
measuring the iq*. The inhibition of plasma kallikrein (0.5 nM) in the presence of APPI, BPTI, KALI-D, and 
KALI-DY is shown in Figure 4A. The inhibition of FXIa (3.5 nM) by APPI, BPTI, KALI- 10 and KALI-DY 
is shown in Figure 4B, and Kj* values are reported in Tables III and IV. 



25 
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Example 4 
Coagulation A ?f?aY 8 

Methods 

* 

Clotting times for normal human plasma were performed using an MLA Electra 800 Coagulometer 
5 (Medical Laboratory Automation, Inc. Pleasantville, N.Y.) and Dade reagents (Baxter Health Care Corp., Miami 
FL). The clotting time was determined by optical assessment 

For the activated partial thromboplastin time (APTT) assays, the activation time was set at 120 sec and 
acquisition time at 300 to 600 sec depending on the expected outcome of the assay. Citrated normal human 
plasma and inhibitor were incubated together. The sample (plasma and inhibitor) and activator (Actin FS) were 
10 automatically pipetted and incubated together for 2 min at 37 °C, then CaCl 2 was added and clotting time 
determined by means of optical assessment. The total incubation time of inhibitor with plasma was ca. 3 min 
before addition of activator, and 5 min before addition of CaCl 2 . 
Results 

As predicted from the specificity assays, KALI-DY prolonged the clotting time in the APTT, a measure 
of the intrinsic coagulation pathway, but not the PT, a measure of the extrinsic coagulation pathway (Figure 5 A). 
KALI-DY was significantly more effective than either APPI, a FXIa inhibitor (Kj* = 2.7 nM) or BPTI, a 
kallikrein inhibitor (Kj - 45 nM)/ In addition, although plasma prekallikrein is present at 600 nM in plasma, 
250 nM KALI-DY can prolong clotting by 2-fold 

KALI-DY prolonged the clotting time of the surface mediated contact activation pathway in a 
concentration dependent manner as measured by the APTT. A greater than 3.5-fold prolongation of the clotting 
time at 1 uM was observed with KALI-DY compared with a 2.8 and 1.8-fold prolongation observed with APPI 
and BPTI, respectively (Fig. 5A). In contrast, neither KALI-DY, BPTI, nor APPI appreciably prolonged the 
clotting time in a tissue factor initiated PT assay (Figure 5B). 

All references cited herein are expressly incorporated by reference. 



15 



20 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Genentech, Inc. 
(ii) TITLE OF INVENTION: KUNITZ TYPE PLASMA KALLIKREIN INHIBITORS 
5 (iii) NUMBER OF SEQUENCES: 72 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Blvd 

(C) CITY: South* San Francisco 
10 (D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 
15 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
20 (B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Kubinec, Jeffrey S. 

(B) REGISTRATION NUMBER: 36,575 

25 (C) REFERENCE/DOCKET NUMBER: P0944PCT 

i 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415/225-8228 

(B) TELEFAX: 415/952-9881 

(C) TELEX* 910/371-7168 

30 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Ser Gly His Cys Arg 
1 5 10 15 

Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 



40 Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 



20 



25 



30 



35 



40 



45 



Asp Thr Glu Glu Tyr 



Cys Ala Ala Val Cys Gly Ser Ala 
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50 55 58 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 
> (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly His Cys Arg 
1 5 in 



10 



15 



Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
15 50 55 58 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
20 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Leu Gly His Cys Arg 
15 10 15 

Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
25 20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 4 5 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 , 55 58 

30 (2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Asp Gly Pro Cys Arg 
1 5 io * 15 

Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 
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Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

5 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY; Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Glu Gly His Cys Arg 
1 5 10 15 

Ala Ala He Leu Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 

15 Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 

35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

(2) INFORMATION FOR SEQ ID NO: 6: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

25 Val Arg Glu Val Cys Ser Glu Gin Ala Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 10 amino acids 
30 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met His Ser Phe Cys Ala Phe Lys Ala Asp 
15 10 

35 (2) INFORMATION FOR SEQ, ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Lys Pro Asp Phe Cys Phe Leu Glu Glu Asp 
15 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

10 Gly Pro Ser Trp Cys Leu Thr Pro Ala Asp 
1 5 10 ' 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 amino acids 
!5 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys Glu Asp Ser Cys Gin Leu Gly Tyr Ser 
1. 5 io 

20 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Thr Val Ala Ala Cys Asn Leu Pro He Val 
1 5 io 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Leu Pro Asn Val Cys Ala Phe Pro Met Glu 
35 1 5 ' io 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
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(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Arg Pro Asp Phe Cys Leu Glu Pro Pro Tyr 
1 5 10 

5 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: Amino, Acid 
(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe 
1 5 10 14 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 14 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Arg Phe Phe Phe Asn He Phe Thr Arg Gin Cys Glu Glu Phe 
20 1 5 10 14 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: Amino Acid 
25 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Arg Tyr Phe Tyr Asn Asn Gin Thr Lys Gin Cys Glu Arg Phe 
1 5 10 14 

(2) INFORMATION FOR SEQ ID NO: 17: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

35 Arg Phe Tyr Tyr Asn Ser Val He Gly Lys Cys Arg Pro Phe 
15 10 14 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 14 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

5 Arg Tyr Phe Tyr Asn Gly Thr Ser Met Ala Cys Glu Thr Phe 
15 10 14 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 amino acids 
10 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 19: 

Leu Trp Ala Phe Asp Ala Val Lys Gly Lys Cys Val Leu Phe 
15 10 14 

15 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Lys Trp Tyr Tyr Asp Pro Asn Thr Lys Ser Cys Ala Arg Phe 
1 5 10 14 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 14 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Arg Trp Phe Phe Asn Phe Glu Thr Gly Glu Cys Glu Leu Phe 
30 1 5 10 14 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: Amino Acid 
35 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Arg Tyr Phe Tyr Asn Ala Lys Ala Gly Leu Cys Gin Thr Phe 
15 10 14 

(2) INFORMATION FOR SEQ ID NO: 23: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu 
15 10 15 

Tyr Cys Ala Ala Val Cys Gly Ser Ala 
20 24 

10 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu 
15 10 15 

Tyr Cys Met Ala Val Cys Gly Ser Ala 
20 24 

20 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Tyr Gly Gly Cys Glu Gly Asn Gin Asn Arg Phe Glu Ser Leu Glu 
1 5 10 15 

Glu Cys Lys Lys Met cys Thr Arg Asp 
20 24 

30 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Tyr Gly Gly Cys Leu Gly Asn Met Asn Asn Phe Glu Thr Leu Glu 
15 10 15 

Glu Cys Lys Asn He Cys Glu Asp Gly 
20 24 
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(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: Amino Acid 
5 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Tyr Ser Gly Cys Gly Gly Asn Glu Asn Asn Phe Thr Ser Lys Gin 
1 5 10 d 15 

Glu Cys Leu Arg Ala Cys Lys Lys Gly 
10 20 . 24 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: Amino ( Acid 
15 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Tyr Gly Gly Cys Met Gly Asn Gly Asn Asn Phe Val Thr Glu Lys 
1 5 10 15 

Glu Cys Leu Gin Thr Cys Arg Thr Val 
20 20 ~ ~ 24 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: Amino Acid 
25 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Tyr Gly Gly Cys Gin Gly Asn Gly Asn Lys Phe Tyr Ser Glu Lys 
15 10 15 

Glu Cys Arg Glu Tyr Cys Gly Val Pro 
30 20 24 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: Amino Acid 
35 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Tyr Gly Gly Cys Gly Gly Asn Glu Asn Lys Phe Gly Ser Gin Lys 
15 10 is 

Glu Cys Glu Lys Val Cys Ala Pro Val 
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20 24 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 24 amino. acids 
5 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 31: 

Tyr Gly Gly Cys Gly Gly Asn Ser Asn Asn Phe Leu Arg Lys Glu 
1 5 10 15 

10 Lys Cys Glu Lys Phe Cys Lys Phe Thr 

20 24 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 amino acids 
!5 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Tyr Gly Gly Cys Arg Ala Lys Arg Asn Asn Phe Lys Ser Ala Glu 
1 5 10 15 

20 Asp Cys Met Arg Thr Cys Gly Gly Ala 

20 24 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 
25 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Glu Gly Arg Cys Arg 
1 5 10 ~ ~ 15 

30 Ala Ser lie Leu Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 

20 2 5 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly. Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
35 50 55 58 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
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(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg 
15 10 * 15 

5 Ala Met He Ser Arg Xrp Tyr Phe Asp Val Thr Glu Gly Lys Cys 

20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Met Ala Val Cys Gly Ser Ala 
10 50 55 58 

(2) INFORMATION FOR SEQ ID NO: 35: 

* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
15 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Lys 
1 5 10 15 

Ala He Met Lys Arg Phe Phe Phe Asn He Phe Thr Arg Gin Cys 
20 20 25 ~ 30 

Glu Glu Phe He Tyr Gly Gly Cys Glu Gly Asn Gin Asn Arg Phe 
35 40 ~ 45 

Glu Ser Leu Glu Glu Cys Lys Lys Met Cys Thr Arg Asp 
50 55 58 

25 (2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Lys Pro Asp Phe Cys Phe Leu Glu Glu Asp Pro Gly He Cys Arg 
1 5 10 15 

Gly Tyr He Thr Arg Tyr Phe Tyr Asn Asn Gin Thr Lys Gin Cys 
20 25 30 

35 Glu Arg Phe Lys Tyr Gly Gly Cys Leu Gly Asn Met Asn Asn Phe 

35 40 45 

Glu Thr Leu Glu Glu -Cys Lys Asn He Cys Glu Asp Gly 
50 55 58 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
5 (D) TOPOLOGY: Jiinear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Gly Pro Ser Trp Cys Leu Thr Pro Ala Asp Arg Gly Leu Cys Arg 
1 5 10 15 

Ala Asn Glu Asn Arg Phe Tyr Tyr Asn Ser Val lie Gly Lys Cys 
« 20 25 * 30 

Arg Pro Phe Lys Tyr Ser Gly Cys Gly Gly Asn Glu Asn Asn Phe 
35 40 45 

Thr Ser Lys Gin Glu Cys Leu Arg Ala Cys Lys Lys Gly 
50 55 58 

15 (2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino. Acid 
(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Lys Glu Asp Ser Cys Gin Leu Gly Tyr Ser Ala Gly Pro Cys Met 
1 5 10 15 

Gly Met Thr Ser Arg Tyr Phe Tyr Asn Gly Thr Ser Met Ala Cys 
20 25 30 

25 Glu Thr Phe Gin Tyr Gly Gly Cys Met Gly Asn Gly Asn Asn Phe 

35 40 45 

Val Thr Glu Lys Glu Cys Leu Gin Thr Cys Arg Thr Val 
50 55 58 

(2) INFORMATION FOR SEQ ID NO: 39: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

35 Thr Val Ala Ala Cys Asn Leu Pro He Val Arg Gly Pro Cys Arg 
1 5 10 ~ is 

Ala Phe He Gin Leu Trp Ala Phe Asp Ala Val Lys Gly Lys Cys 
20 . 25 30 

Val Leu Phe Pro Tyr Gly Gly Cys Gin Gly Asn Gly Asn Lys Phe 
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35 40 45 

Tyr Ser Glu Lys Glu Cys Arg Glu Tyr Cys Gly Val Pro 
50 55 58 

(2) INFORMATION FOR S±Q ID NO:40: 

> (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Glu Thr Asp lie Cys Lys Leu Pro Lys Asp Glu Gly Thr Cys Arg 
1 5 10 15 

Asp Phe He Leu Lys Trp Tyr Tyr Asp Pro Asn Thr Lys Ser Cys 
20 25 ~ 30 

Ala Arg Phe Trp Tyr Gly Gly Cys Gly Gly Asn Glu Asn Lys Phe 
15 35 40 * 45 

Gly Ser Gin Lys Glu Cys Glu Lys Val Cys Ala Pro Val 
50 f 55 58 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
<D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Leu Pro Asn Val Cys Ala Phe Pro Met Glu Lys Gly Pro Cys Gin 
25 1 5 io 15 

Thr Tyr Met Thr Arg Trp Phe Phe Asn Phe Glu Thr Gly Glu Cys 
20 25 " 30 

Glu Leu Phe Ala Tyr Gly Gly Cys Gly Gly Asn Ser Asn Asn Phe 
35 40 45 

30 Leu Arg Lys Glu Lys Cys Glu Lys Phe Cys Lys Phe Thr 

50 55 58 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 
35 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Arg Pro Asp Phe Cys Leu Glu Pro Pro Tyr Thr Gly Pro Cys Lys 
1 5 10 * ' 15 
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Ala Arg lie He Arg Tyr Phe Tyr Asn Ala Lys Ala Gly Leu Cys 
20 25 20 

Gin Thr Phe Val Tyr Gly Gly Cys Arg Ala Lys Arg Asn Asn Phe 
35 • 40 45 

5 Lys Ser Ala Glu Asp Cys Met Arg Thr Cys Gly Gly Ala 

50 55 58 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 
10 (B) TYPE:. Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg 
1 5 10 * 15 

15 Ala Leu His Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 

20 25 30 

Ala Pro Phe Tyr Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
20 50 55 58 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
25 (D) TOPOLOGY: Linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg 
1 5 10 is 

Ala Ala His Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
30 20 25 * 30 

Ala Pro Phe Val Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

35 (2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Asp Gly His Cys Arg 
1 5 4 10 15 

Ala Ala His Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
5 20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

10 (2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Asp Gly His Cys Arg 
1 5 10 15 

Ala Ala His Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 * 30 

20 Ala Pro Phe Val Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 

35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 , 55 58 

(2) INFORMATION FOR SEQ ID NO:47: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

30 Val Arg Glu Val Cys Ser Glu Gin Ala Glu Asp Gly His Cys Arg 
1 5 io is 

Ala Ala His Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 * * 30 

Ala Pro Phe Tyr Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

(2) INFORMATION FOR SEQ ID NO: 48: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Pro Gly His Cys Arg 
1 5 10 4 15 

Ala Ala His Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

(2) INFORMATION FOR SEQ ID NO: 49: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



10 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Pro Gly His Cys Arg 
1 5 10 ^ 15 

Ala Ala His Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 , 25 * 30 

Ala Pro Phe Val Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
25 35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50. 55 58 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 58 amino acids 

<B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Pro Gly His Cys Arg 
35 1 5 10 is 

Ala Ala His Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 " 30 

Ala Pro Phe Tyr Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 
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Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 . 55 58 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Glu Gly His Cys Arg 
10 1 5 10 * * 15 

Ala Ala His Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

15 Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 

50 55 58 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 
20 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Glu Gly His Cys Arg 
1 5 10 15 

25 Ala Ala His Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 

20 25 ^ J 30 

Ala Pro Phe Val Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
30 50 55 58 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
35 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Glu Gly His Cys Arg 
1 5 10 15 

. Ala Ala His Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
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20 ' 25 30 

Ala Pro Phe Tyr Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
5 50 55 58 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
10 (D) TOPOLOGY: Linear 

<(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Asp Gly His Cys Arg 
1 5 10 15 

Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
15 20 25 " 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

20 (2) INFORMATION FOR SE£ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Asp Gly Pro Cys Arg 
1 5 , 10 15 

Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 

30 Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 

35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

(2) INFORMATION FOR SEQ ID NO: 56: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

-49- 



W09639519 f file:/A\dcwasQ3\firmdata\lp\FolevPat\PatentPocuments\W09639519.CPC l Pag e 52 of 71 

• ■ 

WO 96/39519 \ PCT/US96/09059 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Asp Gly Pro Cys Arg 
1 5 10 15 

Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 

5 Ala Pro Phe He Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 

« 40 . 45 

Asp Thr Glu Glu Tyr dys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

(2) INFORMATION FOR SEQ ID NO: 57: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

15 Val Arg Glu Val Cys Ser Glu Gin Ala Glu Asp Gly Arg Cys Arg 
1 5 10 " 15 

Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 ' 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
20 35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 58 amino acids 

<B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Glu Gly Thr Cys Ara 
30 1 5 10 !5 

Ala Asn He Tyr Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 " 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly A$n Arg Asn Asn Phe 
35 40 45 

35 Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 

50 55 ' 58 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 

i 
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(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Leu Gly Gly Cys Arg 
5 1 5 10 15 

Ala Trp He Leu Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

10 Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 

50 55 58 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 
15 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Pro Gly His Cys Arg 
1 5 10 ' 15 

20 Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 

20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
25 50 55 58 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
30 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Pro Gly Leu Cys Arg 
15 10 15 

Ala Ala Phe Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
35 20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
5 (D) TOPOLOGY: Linear 

i t 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Pro Gly Leu Cys Arg 
1 5 10 15 

Ala Ala lie Tyr Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
10 20 25 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 « 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

15 (2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Pro Gly Leu Cys Arg 
1 5 . 10 * 15 

Ala Leu He Trp Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 * 30 

25 Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 

35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

(2) INFORMATION FOR SEQ ID NO: 64: 

30 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

35 Val Arg Glu Val Cys Ser Glu Gin Ala Glu Pro Gly Arg Cys Arg 
1 5 io 15 

Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20- 25 * 30 

. Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
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35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

(2) INFORMATION POR SEQ ID NO: 65: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

10 Val Arg Glu Val Cys Ser Glu Gin Ala Glu Arg Gly His Cys Arg 
1 5 . 10 is 

Ala Ala He Pro Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 ^ 30 

Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
15 35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg 
25 1 5 10 15 

Ala Ala His Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 ' 30 

Ala Pro Phe Val Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 , 45 

30 Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 

50 55 58 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 
35 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

* 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg 
15 10 15 
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Ala Ala His Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 

Ala Pro Phe Tyr Tyr Gly Gly Cys Gly Gly Asn. Arg Asn Asn Phe 
35 40 . 45 

5 Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 

50 55 58 

(2) INFORMATION FOR SEQ ID N0:68: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 amino acids 
10 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg 
1 5 10 15 

15 Gly Ala His Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 

20 25 30 

Ala Pro Phe Val Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 * 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
20 50 55 58 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
25 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg 
1 5 in 



15 



Gly Ala His Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
30 20 25 30 

Ala Pro Phe Trp Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

35 (2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg 
1 5 . 10 ' ' 15 

■ Ala Leu His Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
5 20 25 ' 30 

Ala Pro Phe Tyr Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 
35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 55 58 

10 (2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Val Arg Glu Val Cys S^r Glu Gin Ala Glu Thr Gly Pro Cys Arg 
1 5 10 " 15 

Ala Asn His Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
20 25 30 

20 Ala Pro Phe Ser Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe 

35 40 45 

Asp Thr Glu Glu Tyr Cys Ala Ala Val Cys Gly Ser Ala 
50 , 55 58 

(2) INFORMATION FOR SEQ ID NO: 72: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

30 Asp Gly His Cys Arg Ala Ala His Pro 
1 5 9 
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WHAT IS CLAIMED IS: 

1. A polypeptide comprising a non-native Kunitz-type serine protease inhibitor domain said 
Kunitz-type serine protease inhibitor domain having a primary binding loop: 
Xaa 5 -Xaa 4 -Xaa 3 -Xaa 2 -Xaa 1 -Xaa j l -Xaa 2 , .Xaa 3 , -Xaa 4 ' 
5 and a secondary binding loop comprising: 
Xaaj 9 f 

wherein; 

Xaa 5 is selected from the group consisting of P, D, E, S, T, R, and L; 
Xaa 4 is G; 

10 Xaa 3 is selected from the group consisting of H, P, R, L, G, and T; 
Xaa2 is C; 
Xaaj isR; 

Xaaj' is selected from the group consisting of A and G; 

Xaa^ is selected from the group consisting of A, L, N, W, and S; 
15 Xaa 3 * is selected from the group consisting of H and I; 

Xaa 4 ' is selected from the group consisting of P, Y, L, and W; 

and Xaa 19 ! is selected from V, Y, W, S, and F; 

said polypeptide being capable of inhibiting plasma kallikrein. 
2. The polypeptide of claim 1 wherein 
2 0 Xaa 5 is selected from the group consisting of P, D, and E; 

Xaa 4 is G; 

Xaa 3 is selected from the group consisting of H, P, R, and L; 
Xaa2 isC; 
Xaa j is R; 

2 5 Xaaj 1 is selected from the group consisting of A and G; 

Xaa^ is selected from the group consisting of A, L, N, and W; 

Xaa 3 * is selected from the group consisting of H and I; 

Xaa 4 f is selected from the group consisting of P, Y, L, and W; 

and Xaajp 1 is selected from V, Y, W, and S. 
30 3. The polypeptide according to claim 2 wherein the sequence Xaa 5 -Xaa 4 -Xaa 3 -Xaa 2 -Xaa ] - 

Xaaf-Xaaj'-Xaa^-Xaa^ is DGHCRAAHP (SEQ ID NO: 72) and 

and Xaa j 9 f is selected from V, W and Y. 

4. The polypeptide according to claim 1 wherein the Kunitz-type serine protease inhibitor domain 
has the sequence: 
35 R r Xaa 5 -Xaa 4 -Xaa 3 -Xaa2-^ 

where Rj is a 10 amino acid peptide wherein the amino acid corresponding to amino acid position Xaaj j is a 
C; 
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R2 is a 14 amino acid peptide wherein the amino acid corresponding to amino acid position Xaa 15 ' is a C; and 
R 3 is a 24 amino acid peptide wherein the amino acid corresponding to amino acid position Xaa 2 3 , > Xaa 36 t and 
Xaa 4 Q' are a C. 

5. The polypeptide according to claim 4 where Rj is selected from the group consisting of 
5 VREVCSEQAE (SEQ ID NO: 6), 

MHSFCAFKAD (SEQ ID NO: 7), 

KPDFCFLEED (SEQ ID NO: 8), 

GPSWCLTPAb (SEQ ID NO: 9), 

KEDSCQLGYS (SEQ ID NO: 10), 
10 TVAACNLPIV (SEQ ID NO: 1 1), 

LPNVCAFPME (SEQ ID NO: 12), and 

RPDFCLEPPY (SEQ ID NO: 13); 
R2 is selected from the group consisting of 

RWYFDVTEGKCAPF (SEQ ID NO: 14), 
15 RFFFNIFTRQCEEF (SEQ ID NO: 15), 

RYFYNNQTKQCERF (SEQ ID NO: 16X 

RFYYNSVIGKCRPF (SEQ ID NO: 17), 

RYFYNGTSMACETF (SEQ ID NO: 18), 

LWAFDA VKGKCVLF (SEQ ID NO: 19), 
2 0 KWYYDPNTKSCARF (SEQ ID NO: 20), 

RWFFNFETG^CELF (SEQ ID NO: 21), and 

RYFYNAKAGLCQTF (SEQ ID NO: 22); and 
R3 is selected from the group consisting of 

YGGCGGNRNNFDTEEYCAAVCGSA (SEQ ID NO: 23), 

2 5 YGGCGGNRNNFDTEEYCMAVCGSA (SEQ ID NO: 24), 

YGGCEGNQNRFESLEECKKMCTRD (SEQ ID NO: 25), 
YGGCLGNMNNFETLEECKNICEDG (SEQ ID NO: 26), 
YSGCGGNENNFTSKQECLRACKKG (SEQ ID NO: 27), 
YGGCMGNGNNFVTEKECLQTCRTV (SEQ ID NO: 28), 

3 0 YGGCQGNGNKFYSEKECREYCGVP (SEQ ID NO: 29), 

YGGCGGNENKFGSQKECEKVCAPV (SEQ ID NO: 30), 
YGGCGGNSNNFLRKEKCEKFCKFT (SEQ ID NO: 3 1), and 
YGGCRAKRNNFKSAEDCMRTCGGA (SEQ ID NO: 32). 
6. The polypeptide according to claim 5 wherein Rj represents amino acid residues MO of APPI 
3 5 (SEQ ID NO: 6) or conservative amino acid substitutions thereof; 

R 2 represents amino acid residues 20-33 of APPI (SEQ ID NO: 14) or conservative amino acid 
substitutions thereof; and 

R 3 represents amino acid residues 35 through 58 of APPI (SEQ ID NO: 23) or conservative amino acid 
substitutions thereof. 
40 7. The polypeptide of claim 6 wherein 
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Xaa^ is selected from the group consisting of P f D, and E; 
Xaa 4 is G; 

Xaa 3 is selected from the group consisting of H, P, R, and L; 
Xaa2 is C; 
5 Xaaj is R; 

Xaaj* is selected from the groupconsisting of A and G; 
Xaa^ is selected from the group consisting of A, L, N, and W; 
Xaa 3 ' is selected from the group consisting of H and I; 
Xaa 4 ' is selected from the group consisting of P, Y, L, and W; 
10 and Xaa 19 f is selected from V, Y, W, and S. 

8. The polypeptide according to claim 7 wherein the sequence Xaa 5 -Xaa 4 -Xaa 3 -Xa^-Xaa { - 
Xaa 1 , .Xaa 2 , -Xaa 3 , -Xaa 4 ' is DGHCRAAHP (SEQ ID NO: 72) and 

and Xaa 19 ' is selected from V, W and Y. 

9. DNA encoding the polypeptide of claim 1 . 

15 1 °« The DNA of claim 9 further comprising an expression control sequence operabry linked to the 

DNA. 

11. An expression vector comprising the DNA molecule of claim 1 0. 

1 2. A host cell transformed with the expression vector of claim 1 1 , 

13. A method for expressing a DNA molecule encoding a serine protease inhibitor in a host cell, 
20 comprising culturing the host ceir of claim 12 under conditions suitable for expression of the polypeptide. 

14. A pharmaceutical composition comprising a pharmaceutically acceptable excipient and the 
polypeptide of claim 1. 

15. A method for treating a mammal for which inhibiting plasma kallikrein is indicated comprising 
administering a pharmaceutically effective amount of the composition of claim 14 to the mammal. 
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